FIG. 1 

AlaSerCysLeuAsnCysSerAlaSerllelleProAspArgGluValLeuTyrArgGlu 
1 GGCCTCCTGCTTGAACTGCTCGGCGAGCATCATACCTGACAGGGAAGTCCTCTACCGAGA 
CCGGAGGACGAACTTGACGAGCCGCTCGTAGTATGGACTGTCCCTTCAGGAGATGGCTCT 

PheAspGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeu 
6 1 GTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGA 

AlaGluGlnPheLysGlnLysAlaLeuGlyLeu 
121 CGCCGAGCAGTTCAAGCAG AAGGCCCTCGGCCTCC 
GCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 



FIG. 3 

GlyCysValVallleValGlyArgValValLeuSerGlyLysProAlallelleProAsp 
1 CTGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTG 
GACCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGAC 

T 

ArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuProTyr 
6 1 ACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCA 

A 

IleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGln 
121 ACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluValZleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
181 AGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGly 
24 1 TCG AGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCC 

LeuS erThr LeuP roGlyAs n ProAla I leAl aSer LeuMe tAl a PheThr Al a Al a Va 1 
301 GCTTGTCAACGCTGCCTGGTAACCGCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTG 
CGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

ThrSerProLeuThrThrScrGln 
361 TCACCAGCCCACTAACCACTAGCCAAA 
AGTGGTCGGGTGATTGGTGATCGGTTT 
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FIG. 6 



AspAlaHlsPheLeuSerGlnThxLysGLnSerClyGLuAsnLeuProTyrLeuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGLnMetTrp 
61 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysI^uIleArgLeuLysProThxLeuHisGlyProThrProLeuLeuTyrArgLeu 
1 2 L AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCC AACACCCCTGCTATAC AGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlrLAsnGluIleThrLcuThrHisProValThrLysTyrlleMetThrCys 
181 GG CG CTG TTCAG AA TG AAATC AC CCTGACGCACCCAGTCACC AAA T AC ATC A TG AC ATGC 
C CG CG AC AAG TCTT ACTTT AGTGGG ACTGC GTGGGTC AG TGGTTT A TGT AG T AC TGT AC G 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
24 1 ATGTCGGCCGACCTGGAGGTCGTCACG AGC ACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 

AlaLeuAlaAlaTy rCysLeuSerTKrC LyCys Va lVa 1 1 leva 1G 1 yArg Va 1 Va lLeu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMei: 
361 TCCCGG AAGCCGGCAATCATACCTGACAGGG AAGTCCTCTACCGAGAGTTCGATG AG ATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGInGlyMetMetLeuALaGluGlnPhe 
4 21 G AAGAGTGCTCTCAGCACTTACCGTACATCGAGC AAGGG ATG ATGCTCGCCGAGCAGTTC 
CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAG 

LysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
4 81 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGA 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGG AACTTC 
CGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAG 

IleSerGlylleGlnTyrLeuALaGlyLeuScrThrLeuProGlyAsnProAlalleAla 

601 ATCAGTGGGATACAATACTTGGCGGGCTTGTG^ 

TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGA 

SerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
661 TCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 7 



Overlap with 81 

PheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeu 
1 CTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAT . 
GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
61 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
1 8 1 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

ValValGlyVaiyalCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
3 6 1 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 9A 

Serl leGl uTh r 1 1 eThr LeuPr oG 1 nAspA 1 aVa 1 Se r A rgTh rG 1 nA rg A rgG 1 yA rg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGT^ 

AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

c i JprGl-yArgGlyLysProGly 1 1 eTy rA rgPheVa 1 A 1 aProG 1 yG 1 uA rgProSe rG 1 y 
61 ACTGGCAGGGGGAAGCCAM 

TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 
MetPheAspSe^ 

ATGTTCGAqCGTCCGTCCTCTGTGA 

TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 
ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlvLeuProVal 

AGGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 

, ni HJs^LeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCT^ 

GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

■m AlfMy5l^^ laAr 9 AlaGlnAlaPr °ProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGAT^ 

CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTryArgLeuGlyAla 
TTGATTCGC^CAAGCCCACa 

AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

Va 1 G 1 nAsnG 1 u 1 1 eTh rLeuTh rH i sProVa ITU rLysTy r 1 1 efle tTh rCysMe tSe r 
GTTCAGAATGAAATCACC^GA^ 

CAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGC 

Al a AspLeuG 1 uVa 1 Va 1 ThrSe rTh rTrpVa 1 LeuVa 1 G 1 yG 1 y Va 1 LeuA 1 a A 1 aLeu 
GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTG6CGGCGTCCTGGCTGCTTTG 
CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

„ , AlaAl aTy rCysLeuSe rTh rG 1 yCys Va lVal I leValGlyA rgVa 1 Va 1 LeuSe rG 1 y 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGT 

CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCC 

CC i U^^ifU?U??C2^?R Ar 9 GluValLeuT y rAr 9GluPheAspGluMetGluGlu 
661 AAGCCGGCAATCATACCTGACAGGGAAGT^ 

TTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTC 

CysSerGlnHisLeuProTym 
721 IGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGA 

ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTC 

™, LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal I leAlaProAlaVal 
781 AAGGCCaCGGCaCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTC 
TTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAG 
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FIG. 9B 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHIsMetTrpAsnPhelleSer 
841 CAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCA 

_ GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
901 GGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTG 
CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAAC 

^ MetA 1 aPheTh rA 1 aA 1 aVa 1 Th rSe rProLeuTh rTh rSerG 1 nTh rLeuLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAAC 
TACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGACAAGTTG 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTG 
TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
1081 GGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGAC 
CCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTG 

ii( ^ IleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 
1141 ATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGC 
TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
1201 GGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

_ AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
1261 GCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAG 
CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
1321 GGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 1 1 

Overlap with 32 

MetAsnArgLeuHeAlaPheAlaSerArgGlyAsnHisValSerProThrHisTy 
1 GATGAACCGGCTGATAGCCTTCGCCTCCCG6GGGAACCATGTTTCCCCCAC6CACTAC6T 
CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCA 

c i r &9§ I uSerAspAlaAlaAlaArgValTh r A 1 a 1 1 eLeuSe rSerLeuTh rVa 1 Th rG 1 n 
61 GCCGGAGAGCGATGCAGCTGCCC^ 

CGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGT 

10 . r LeuLeuArgArqLeuHisGl^ 

121 GCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 
CGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAG 

ioi rTfPt?y A ^?RU^R^Rl^RU eC y sGluvalLeuSerA spPheLysThrTrpLeu 
181 CTGGCTAAGGGACATCTGGGACT^ 

GACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

241 AAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTA 
TTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCAT 

LysGlyValTrpArgVal 
301 TAAGGGGGTCTGGCGAGTG 
ATTCCCCCAGACCGCTCAC 
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FIG. 13 



CysSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCys 
1 ACTGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 



ThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACA* 

Overlap with 3 3b 

SerAspPheLysThrTrpLeuI-ysAlaLysLeuMetProGlnLeuProGlylleProPhe 
121 TGAGCG ACTTTAAG ACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGA 



ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHisThrArg 
181 TTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCG AGGGGACGGCATCATGCACACTC 
AACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCCCCTGCCGTAGTACGTGTGAG 

CysHisCysGlyAlaGluileThrGlyHisValLysAsnGlyThrMetArglleValGly 
241 GCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCG 
CGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly 
301 GTCCTAGG ACCTGCAGG AAC ATGTGG AGTGGG ACCTTCCCCATTAATCCCTACACC ACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

ProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGlu 
361 GCCCCTG.TACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAG 
CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

GluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyKetThrThrAsp 
4 21 AGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTG 
TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
4 81 ACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 14A 

A 1 aTy rMe tSe rLy sA 1 aH i sG 1 y 1 1 eAspProAsn 1 1 e A rgTh rG 1 yVa 1 A rgTh rile 
1 TGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 

61 TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

1 1 eLeuG 1 y 1 1 eG 1 yTh rVa 1 Leu AspG 1 nA 1 aG 1 uThrA 1 aG 1 yA 1 a A rgLeuVa 1 Va 1 
181 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGG6CGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

A 1 aLeuSe rTh rTh rG 1 yG 1 u 1 1 eProPheTy rG 1 yLy s A 1 a 1 1 eP roLeuG 1 uVa 1 1 1 e 
301 TGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 

LysGlyGlyArgHisLeul lePheCysHisSerLysLysLysCysAspGluLeuAlaAla 

361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
421 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCQACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

1 1 eP roTh rSe r G.l y A spVa 1 Va 1 Va 1 Va 1 A 1 aTh r A sp A 1 a Le uMe tTh rG 1 yTy r Th r 
481 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

G 1 yAspPheAspSe rVa I I 1 e AspTy r AsnTh rCysVa 1 Th rG 1 nTh rVa 1 AspPheSe r 
541 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
601 CCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGG 
AGTTGCAGCCCCCTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

G 1 uA rgProSerG 1 yMetPheAspSerSerVa 1 LeuCysG 1 uCysTy rAspA laG 1 yCys 
721 GGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
781 TGCTTGGTATGAGCTCACGCCCGCGGAGACTACAGTTAGGCTACGAGCGTACATGAACAC 
ACGAACGATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTG 

P roG lyLeuProVa 1 CysG 1 nAspH IsLeuG 1 uPheT rpG 1 uG 1 y Va 1 PheThrG i y Leu 
841 CCCGGGGCTTCCCGTGTGCCAGGAGCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCT 
GGGCCCCGAAGGGGACACGGTCCTGGTAGAACTTAAAACCCTCCCGGAGAAATGTCCG6A 



> 



FIG. 14B 

Th rH i s 1 1 eAspA 1 aH i SPheLeuSerG 1 nTh rLysG 1 nSe rG 1 yG 1 uAspLeuProTy r 
901 CACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTA 
GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT 

LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCT 

GlnMetTrpLysCysLeuI leArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCT 
GGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGA 

TyrArgLeuGlyAlaValGlnAsnGluI leThrLeuThrHisProValThrLysTyrl le 
1081 ATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCGAGTCACCAAATACAT 
TATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTA 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGly 
1111 CATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArg 
1201 CGTCCTGGCTGCTTTGGCCGCGTATTGCGTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

Va 1 Va 1 LeuSe rG 1 yLy s P ro A 1 a 1 1 e 1 1 ePr oA spA rgG 1 uVa 1 LeuTy r A r gG 1 uPhe 
1261 GGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAla 
1321 CGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTGTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

G 1 uG 1 nPheLy sG 1 nLysA 1 aLeuG 1 y LeuLe uG 1 nTh rA 1 aSe r A rgG 1 n A 1 aG 1 uVa 1 
1381 CGAGCAGTTCAAGCAGAAGGCCCTCGGCCTGCTGCAGACCGCGTCCCGTCAGGCAGAGGT 
GCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCA 

ileAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHlsMet 
mm TATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATAT 
ATAGCGGGGACGACAGGTCTGGTTGACGGTTTTTGAGCTCTGGAAGACCCGCTTCGTATA 

TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA . 
GCGGT ( AACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGG 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 



) 



FIG. 14C 

1741 GAAGGTCCTCATAGACATCCTTGCAGG6TATGGCGCGG6CGTGGCGGGAGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCG 

TAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCG 

, nr , 1 1 eLeuSe rProG 1 yA 1 aLeuVa 1 Va 1 G 1 yVa 1 Va 1 Cys A 1 aA 1 a 1 1 eLeuA rgA rgH i s 
1861 CATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCA 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
1921 CGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

1rt01 GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValTh 
1981 GGGGAACCATGTTTCCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCAC 
CCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTG 

Alal leLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpI leSer 
2041 TGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspI leTrpAspTrpI leCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATG * 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATAC 

^.GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
2161 CGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACC 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
2221 GATCCCCTTTGTGTCCTGCCAGCGCGGGTATAA6GGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCGAGACCGCTCACCTGCCGTAGTA 

nnn , M lTd rArgCysH i sCysG lyAlaGluI 1 eTh rG 1 yH i sVa 1 Lys AsnG 1 yTh rMetA rg 
2281 GCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAG 
CGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTC 

I leValGlyProArgThrCysArgAsnMetTrpSerGlyThrPheProI leAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGAT 

Th rTh rG 1 yProCy sThrProLeuProA 1 aProAsnTy rThrPheA 1 aLeuTrpA rgVa 1 
2401 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

ni _ SerAlaGluGluTyrValGluI leArgGlnValGlyAspPheHisTyrValThrGlyMet 
2461 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATA 

.„ , ^T^TUCft§R A ?DUylry sC y sProC y sG 1 nV a l ProSerProG 1 uPhePheTh rG 1 u 
2521 GACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
CTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 15 

AlaValAspPheIleProVal61uAsnLeu61uThrThrMetArgSerProValPheThr 
1 GGCG6TGGACTTTATCCCTGTGGA6AACCTAGAGACAACCATGA6GTCCCCGGTGTTCAC 
CCGCCACCTGAAATAGGGACAGCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTG 

AspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAlaPro 
61 GGATAACTCCTCTCCACCAGTAGTGCCCCA6AGCTTCCAGGTGGCTCACCTCCATGCTCC 
CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGG 

Th rG 1 ySe rG 1 yLysSe rTh r LysVa 1 P roA 1 aA 1 aTy rA 1 aA 1 aG 1 nG 1 yTy rLy sVa 1 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGT . 
GTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCA 

LeuVa 1 LeuAsnProSe rVa 1 A 1 aA 1 aTh rLeuG lyPheG 1 yA 1 aTy rMetSe rLy sA 1 a 
181 GCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGC 
CGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCG 

— Overlap with 40b 

HisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerProIle 
241 TCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCAT 
AGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTA 

ThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAsp 
301 CACGTACTCCACGTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGA 
GTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGGGAATACT 

IlellelleCysAspGluCysHlsSerThrAspAlaThrSerlleLeuGlylleGlyThr 
361 CATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCAC 
GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTG 

ValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThrPro 
1*21 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCC 
ACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGG 

"proGlyServalThrvalProHisProA^ 
181 TCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGG 
AGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCC 

G 1 u 1 1 eProPheTy rG 1 yLysA 1 a 1 1 eProLeuG 1 uVa 1 1 1 eLysG 1 yG 1 yArgH 1 sLeu 
541 AGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCT 
TCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGA 

IlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGly 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGG 
GTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCC 

IleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 
661 CATGAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGA 
GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCT 

Va 1 Va 1 Va 1 Va 1 A 1 aTh r AspA 1 aLeufletTh rG 1 yTy rTh rG 1 yAspPheAspSe rVa 1 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGT 
ACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCA 

IleAspCysAsnThrCys 
781 GATAGACTGCAATACGTGTG 
CTATCTGACGTTATGCACAC 



FIG. 16 

ProCysThrCysGlySerSerAspLeuTyrLeuValThrArgHisAlaAspValllePro 
1 CTCCCTGCACTT6C6GCTCCTC6GACCTTTACCTGGTCACGA6GCACGCCGATGTCATTC 
GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAG 

ValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeu 
61 CCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACT 
GGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATGA 

LysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePheArg 
121 TGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTA 
ACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAAT 

Overlap with 

AlaAlavalCysThrArgGlyValAlaLysAlaValAspPhelleProValGluAsnLeu 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACC- 
CCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGG 

33C __ 

GluThrThrMetArgSerProValPheThrAspAsnSer 
241 TAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTC 
ATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAG 



FIG. 17 

, ^Gl^TrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGly 
1 GGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGG 
CCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCC 

c , CvsIlelleTTirSerLeuThrGlyArgAs 

61 GTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGAT 
CACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTA 

ValSerThrAlaAlaGlnThrPheLeuAlaThrCysI leAsnGlyValCysTrpThrVal 
121 TGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGT 
ACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACA 

TyrHisGlyAlaGlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyr 
181 CTACCACGGGGCC6GAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTA 
GATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACAT 

„, , ThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGlvSerArgSerLeuThr 
2m TACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGAC 
ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTG 

Overlap with 8h 

. ProCysTh r cysG 1 ySe rSer AspLeuTy rLeuVa 1 Th rA rgH i s 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
TGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGC 



FIG. 18 



AsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeu 

1 GAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCT 
CTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGA 

Overlap with 25c 

ProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIle 
61 TCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATACGTGGAGAT 
AGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATGCACCTCTA 



ArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeuLysCysPro 
121 AAGGCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTTAAATGCCC 
TTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAATTTACGGG 



CysGlnValProSerProGluPhePheThrGluLeuAspGlyValArgLeuHisArgPhe 
181 GTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTT 
CACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAA 

AlaProProCysLysProLeuLeuArgGluGluValSerPheArgValGlyLeuHisGlu 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGA 
ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCT 

TyrProValGlySerGlnLeuProCysGluProGluProAspValAlaValLeuThrSer 
301 ATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTC 
TATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAG 

MetLeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGly 
361 CATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGG 
GTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

SerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAla 
421 ATCACCCCCCTCTGTGGCCAGCTCCTCG6CTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCG 

ThrCysThrAlaAsnHisAspSerProAsp 
481 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TTGAACGTGGCGATTGGTACTGAGGGGACTA 



FIG. 19 

Overlap with 14c 

SerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThrCysThrAlaAspHis 
1 AGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCAT 
TCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTA 



AspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlu 
61 GACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGC 
CTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCG 

Asn I leThrA rgVa 1 G 1 uSe rG 1 uAsnLysVa 1 Va 1 1 1 eLeuAspSe rPheAspProLeu 
121 AACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTT 
TTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAA 

ValAlaGluGluAspGluArgGluI leServalProAlaGluI leLeuArgLysSerArg 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGG 
CACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCC 

ArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProProLeuValGlu 
241 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAG 
TCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTC 

ThrTrpLysLysProAspTy rG 1 uProProVa 1 Va 1 H i sG 1 yCysProLeuProProPro 
301 ACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCA 
TGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGT 

LysSerProProValPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 



FIG. 20 

ValTrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyr 
1 CGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAACCCGACTA 
GCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTTGGGCTGAT 

Overlap with 8f 

GluProProValValHisGlyCysProLeuProProProLysSerProProValProPro 
61 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCC 
GCTTGGTGGACACCAGGTACCGACGGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGG 

Pro A rgLy s Ly sA r gTh rVa 1 Va 1 LeuTh rG 1 uSe rTh r LeuSe rTh r A 1 a Le uA 1 aG 1 u 
121 GCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGA 
CGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCT 

LeuAlaThrAraSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThr 
181 GCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGAC 
CGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTG 

ThrSerSe rG 1 uP roA 1 aProSerG lyCysProProAspSerAspA 1 aG 1 uSe rPhe 
241 AACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTTTGC 
TTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGAAACG 



# 



FIG. 21 

A laSer ArgSer PheG 1 yierierSerTh rSerGlyl leTh rG 1 yAspAsnTh rf h rThr 
1 GCCTCCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACA 
CGGAGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGT 

Overlap with 33f 

SerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSer 
61 TCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCC 
AGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGG 

MetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThr 
121 ATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACG 
TACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGC 

ValSerSerGluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThr 
181 GTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACA 
CAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGT 

GlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSer 
241 GGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGC 
CCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCG 

AsnSerLeuLeuArgHisHisAsnLeuvalTyrSerThrThrSerArgSer 
301 AACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
TTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 



FIG. 22 

GlyThrTyrValTyrAsnHisLeuThrProLeuArgAspTrpAlaHisAsnGlyLeuArg 
1 GGCACCTATGTTTATAACCATCTGACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGA 
CCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCT 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
61 GATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACG 
CTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGC 

_ TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
121 TGGGGGGCAGATAGCGCCGCGTGCGGTGACATCATCAAGGGCTTGCCTGTTTCCGCCCGC 
ACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCG 

ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 
181 AGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGTTGGAGGTTG 
TCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCAACCTCCAAC 

Le uA 1 a P ro 1 1 eTh r A 1 aTy rA 1 aG 1 nG 1 nTh r A r gG 1 y Le uLe uG 1 y Cy s 1 1 e I LeTh r 
241 CTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACC 
GACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGG 

Overlap with 7e— — 

^ M Se r LeuTh rG 1 yArgAspLy sAsnG 1 nVal G 1 uG 1 yG 1 uVa 1 G 1 n 1 1 eVa 1 SerTh r A 1 a 
301 AGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCT 

TCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGA 

A 1 aG 1 nThrPheLeuA 1 aTh rCys 1 1 eAsnG 1 y Va 1 Cy sTrp 
361 GCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGG 
CGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACC 



# 



FIG. 23 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyr 
1 GGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

IleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAiaGlnLeuHis 
6 1 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCAC 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTG 

ValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMetCys 
121 G TGTGG ATTCCCCCCCTCAACG TCCG AGGGGGGCGCG ACGCCGTCATCTTACTC ATG TG T 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValHisProThrLeuValPheAspIleThrLysLcuLeuLeuAlaValPheGLyPro 
181 GCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

LeuTrpIleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeu 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTT 
GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

.euArgPheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMetvallle 
3 0 1 CTCCGGTTCTGCGCGTT AGCGCGGAAGATG ATCGGAGGCCATTACGTGCAAATGGTCATC 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 



IleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 
361 ATTAAGTTAGGGGCGCTTACTGGCACCTATGTTT ATAACCATCTCACTCCTCTTCGGG AC 
TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTG 

Overlap with 7f 

TrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGln 
4 21 TGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAA 
ACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTT 



481 



MetGluThrLysLeuIleThrTrpGly 
ATGGAGACCAAGCTCATCACGTGGGGGGC 
TACCTf TGGTTCGAGTAGTGCACCCCCCG 



0 



FIG. 24 

G 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA 1 aAspA 1 aA rgVa 1 CysSerCysLeuTrp 
1 6GGAGTACGTCGTTCTCCT6TTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsnLeuVallleLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

, „, , . 1 § s § £ LeuA 1 aG 1 yTh rH i sG 1 yLeuVa 1 Se rPheLeuVa 1 PhePheCy sPheA 1 aTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysG 1 yLysTrpVa 1 ProG lyA 1 aVa UyrThrPheTyrGl yMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

ni , ^LeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeuAspThrGluValAlaAla 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Overlap with lib 

™, ^ SerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrl leSerTrpCysLeuTrpTrpLeuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCTT 

FIG. 25 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
1 CCAGCCCCTTeTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCC 
GGTCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGG 

^ LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
61 CTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGTCAGTCATCA 

Overlap with 33g ■ 

GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeu 
121 GAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCCTACTCTTGGACAGGCGCACTC 
CTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGGATGAGAACCTGTCCGCGTGAG 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeu 
181 GTCACCCCGTGCGCCGCGGAAGAACAGAAACT6CCCATCAATGCACTGAGCAACTCGTTG 
CAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGACTCGTTGAGCAAC 

OI , Ley^C2lii§Hl§AsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
241 CTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAG 
GATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTC 

_ Lys Va ITh rPheAspA rgLeuG 1 nVa 1 LeuAspSerH i sTy rG 1 nAspVa 1 LeuLysG 1 y 
301 AAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAG 
TTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTC 

ValLysAlaAlaAlaSerLysValLysAlaAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 



# 



FIG. 26A 

G 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA la AspA 1 a ArgVa 1 CysSe rCys LeuTrp 
1 GGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsnLeuVallleLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

AlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeuAspThrGluValAlaAla 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Se rCysG 1 yG Ly Va 1 Va 1 LeuVa 1 G 1 yLeuMetA 1 aLeuTh rLeuSe rProTy rTy rLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGG 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCG 

LeuHisValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeu 
421 AACTGCACGTGTGGATTCCCCCCCTCAAEGTCCGAGGGGGGCGCGACGCCGTCATCTTAC 
TTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATG 

MeteysA 1 aVa \H I sProTh rLeuVa 1 PheAsp 1 1 eTh r LysLeuLeuLeuA laVa 1 Phe 
481 TCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCT 
AGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGA 

G 1 y ProLeuTrp 1 1 e LeuG 1 n A 1 aSe rLeuLeuLys Va 1 P roTy rPheVa 1 A rgVa 1 G 1 n 
541 TCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCC 
AGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGG 

G 1 y LeuLeuA rgPheCys A 1 aLeuA i aA rgLysMet I leGlyGlyHi sTy rVa 1 G 1 nMe t 
601 AAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAA 
TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTT 

Va 1 1 1 e 1 1 eLys LeuG 1 y A 1 a Le uTh rG 1 yTh rTy rVa 1 Ty r As nH i s LeuTh rP ro Leu 
661 TGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTC 
ACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAG 

ArgAspTrpA laH i sAsnG 1 yLeuA rgAspLeuA 1 aVa 1 A 1 aVa 1 G 1 uProVa 1 Va 1 Phe 
721 TTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCT 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGA 

Se rG 1 nMe tG 1 uTh rLysLeu II eTh rTrpG 1 yA 1 a AspTh rA 1 aA 1 aCy sG 1 yAsp I le 
781 TCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGGAGATACCGCCGCGTGCGGTGACA 
AGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGT 

1 1 eAsnG 1 yLeuP roVa 1 Se r A 1 aA rgA rgG 1 yA rgG 1 u 1 1 eLeuLeuG 1 yP roA 1 a Asp 
841 TCATCAACGGCTTGCCTGTTTCCGCCCGlAGGGGCCGGGAGATACTGCTCGGGCCAGCCG 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

GlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThr 
901 ATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGA 
TACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCT 
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Arg61yLeuLeuGlyCysIleIleThrSerLeuThrGlyArgAspLysAsn&lnVal61u 
961 CAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCAAGTGG 
GTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACC 

G 1 y_G 1 uVa 1 G 1 n 1 1 eVa 1 SerThr A 1 aA 1 aG 1 nTh rPheLeu A 1 aTh rCy s 1 1 eAsnG 1 y 
1021 AGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATG 
TCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTAC 

ValCysTrpThrValTyrHisGlyAlaGlyThrArgThrl leAlaSerProLysGlyPro 
1081 GGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGAGCATCGCGTCACCCAAGGGTC 
CCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAG 

Val I leGl nMe tTy rThrAsnVa 1 AspG 1 nAspLeuVa 1 G 1 yTrpProA 1 aP roG 1 nG 1 y 
1141 CTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAG 
GACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTC 

SerA rgSe rLeuThrProCysTh r CysG 1 ySerSe rAspLeuTy rLeuVa 1 Th rA rgH i s 
1201 GTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGC 
CATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCG 

AlaAspVallleProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArg 
1261 ACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCC 
TGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGG 

ProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeuCysProAlaGlyHisAla 
1321 GGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACG 
CCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGC 

ValGlyHePheArgAlaAlaValCysThrArgGlyValAlaLysAlaValAspPhelle 
1381 CCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTA 
GGCACCCGTATAAATCCCGGCGCGACACGTGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGluAsnLeuGluThrThrMetArgSerProValPheThrAspAsnSerSerPro 
1141 TCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTC 
AGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAG 

ProVaT Va 1 ProG 1 nSerPheG 1 nVa 1 A 1 aH i sLeuH i sA laProThrG lySerG lyLys 
1501 CACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCA 
GTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGT 

SerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysValLeuValLeuAsnPro 
1561 AAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACC 
TTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGG 

SerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAlaHisGlylleAspPro 
1621 CCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGATCGATC 
GGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAG 

Asn 1 1 eA rgTh rG 1 y Va 1 A rgTh r 1 1 eTh rTh rG 1 ySe r P ro 1 1 eTh rTy rSe rTh rTy r 
1681 CTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCT 
GATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTGGA 

GlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspI lei lei leCysAsp 
1741 ACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATTTGTG 
TGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACAC 

GluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAspGlnAla 
1801 ACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAG 
TGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTC 

G 1 uThrA 1 aG 1 yA 1 aA r gLeuVa 1 Va 1 LeuA 1 aTh r A 1 aThr ProP roG 1 ySe rVa 1 Th r 
1861 CAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCCGTCA 
GTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGT 

Va 1 ProH 1 s ProAsn 1 1 eG 1 uG 1 uVa 1 A 1 aLeuSe rTh rTh rG 1 yG 1 u 1 1 eP roPheTy r 
1921 CTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCAC^ 

GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAAA 
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GlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCysHisSer 
1981 ACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGTCATT 
TGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAA 

LysLysLysCysAspG luLeuAlaAlaLysLeuVal A laLeuGly I leAsnAlaVal A la 
2041 CAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGG 
GTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACC 

TyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAspValvalValvalAla 
2101 CCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGG 
GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACC 

Th r AspA 1 aLeuMetThr G 1 yTy rTh rG 1 yAspPhe AspSe rVa 1 1 1 eA spCys A snTh r 
2161 CAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATA 
GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTAT 

CysValThrGlnThrValAspPheSerLeuAspProThrPheThrlleGluThrlleThr 
2221 CGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCA 
GCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGT 

LeuProG 1 nAspA 1 aVa 1 Se r A rgTh rG 1 nA rgA rgG 1 y A rgTh r G 1 yA r gG 1 y Lys P ro 
2281 CGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGC 
GCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCG 

GlylleTyrArgPheValAlaProGlyGluArgProSerGlyMetPheAspSerServal 
2341 CAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTCGTCCG 
GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGC 

LeuCysG 1 uCysTy r AspA 1 aG 1 yCysA 1 aTrpTy rG luLeuTh rProA 1 aG 1 uThrThr 
2401 TCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAGACTA 
AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGAT 

ValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHisLeuGlu 
2461 CAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCATCTTG 
GTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAAC 

PheTrpGluGlyValPheThrGlyLeuThrHislleAspAlaHisPheLeuSerGlnThr 
2521 AATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCCCAGA 
TTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGGGTCT 

LysG 1 nSe rG 1 yG 1 uAsnLeuProTy rLeuVa 1 A 1 aTy rG 1 n A 1 aTh rVa 1 CysA 1 aA rg 
2581 CAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGCGCTA 
GTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACGCGAT 

A 1 aG 1 nA 1 aProP roP rose rTrpAspG 1 nfletTrpLysCysLeu 1 1 eA rgLeuLysP ro 
2641 GGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTCAAGC 
CCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCG 

ThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluI leThr 
2701 CCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAAATCA 
GGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTTTAGT 

LeuThrH i sProVal ThrLysTy r 1 1 eMetTh rCysMetSer A 1 aAspLeuG 1 uVa 1 Va 1 
2761 CCCTGACGCACCCAGTCAGCAAATACATCATGACATGCATGTCGGCCGACCTGGAGGTCG 
GGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTCCAGC 

ThrSerThrTrpValLeuValGly61yValLeuAlaAlaLeuAlaAlaTyrCysLeuSer 
2821 TCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGT 
AGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACA 

ThrGlyCysValVal I leValGlyArgValValLeuSerGlyLysProAlal lei lePro 
2881 CAACAGGGTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTGCGGGAAGCCGGCAATCATAC 
GTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATG 

AspArgG 1 uVa 1 LeuTy r ArgG 1 uPheAspG 1 uMetG 1 uG 1 uCysSe rGl nH 1 sLeuPro 

2941 ctgacagggaagtcctctaccgSgagttcgatgagatggaagagtgctctcagcacttac 
gactgtcccttcaggagatggctctcaagctactctaccttgtcacgagagtcgtgaatg 
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FIG. 26D 

TyrlleGluGlnGl yMe tMe tLeuA 1 aG 1 uG 1 nPheLysG 1 nLy sA 1 aLeuG lyLeuLeu 
3001 CGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 

GlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLys 
3061 TGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAA 
ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTT 

LeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAla 
3121 AACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGG 
TTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACC 

GlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAla 
3181 CG6GCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCT.TCATTGATGGCTTTTACAGCTG 
GCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGAC 

ValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGlyTrpVal 
3241 CTGTCACCAGGCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGGTGGG 
GACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCCACCC 

AlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeuAlaGly 
3301 TGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTG 
ACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGAC 

AlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGlyTyrGly 
3361 GCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATG 
CGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATAC 

A 1 aG 1 y Va 1 A 1 aG 1 yA 1 aLeuVa 1 A 1 aPheLys 1 1 eMetSe rG 1 yG 1 uVa 1 P roSe rTh r 
3421 GCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCA 
CGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGGAGGT 

GluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValValGlyVal 
3481 CGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCG 
GCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGC 

ValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAlaValGlnTrpMet 
3541 TGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGA 
ACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTCACCT 

AsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrValPro 
3601 TGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGC 
ACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

GluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGlnLeu 
3661 CGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCAGC 
GCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGTCG 

LeuA rgA rgLeuH i sG 1 nTrp 1 1 eSe rSe rG 1 uCy sTh rTh rProCy sSe rG 1 ySe rTrp 
3721 TCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTCCT 
AGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 

LeuArgAspI leTrpAspTrpI leCysGluValLeuSerAspPheLysThrTrpLeuLys 
3781 GGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCTAA 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGATT 

AlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyrLys 
3841 AAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGGTATA 
TTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATAT 

G 1 y Va 1 TrpA rgVa 1 A spG 1 y 1 1 eMe tH i sTh rA rgCysH i sCysG 1 yA 1 aG 1 u 1 1 eThr 
3901 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAGATCA 
TCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTCTAGT 

GlyHisVal LysAsnG lyTh rMet Arg 1 1 eVa 1 G 1 yProA rgTh rCysA rgAsnMetTrp 
3961 CTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAACATGT 
GACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTGTACA 
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FIG. 26E 

^^SerGlyThr^eProIleAsnAlaTyrThrThrGlyProCysThrProLeuProAlaPro 
0021 GGAGTGGGACCTTCCCCATTAATGCCTACACCAC6GGCCCCTGTACCCCCCTTCCTGCGC 
CCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGACGCG 

, no i ^^^f PJYtT!3j:PheAlaLeuTrpArgVa 1 SerA 1 aG 1 uG 1 uTy rVa 1 G 1 u 1 1 eA rgG 1 nVa 1 
4081 CGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAGGCAGG 
GCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCC 

, , GlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeuLysCysProCysGlnVal 
4141 TGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGCCAGG 
ACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCC 

ProSerProGluPhePheThrGluLeuAspGlyValArgLeuHisArgPheAlaProPro 
4201 JCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCGCCCC 
AGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGCGGGG 

. Cys Ly sProLeuLeuA rgG luG 1 uVa 1 Se rPheA rgVa 1 G 1 y LeuH i sG 1 uTyr ProVa 1 
4261 CCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCAC 

GGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATGGGCC 

, GlySerGlnLeuProCysGluProGluProAspValAlaValLeuThrSerMetLeuThr 
4321 TAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATGCTCA 
ATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTACGAGT 

,,o, r AsDPrgSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGLySerProPro 
4381 CTGATCCqCCCATATAACAGCAGM 

GACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGG 

, , , , ^ SerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThrCysThr 
4441 CCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCA 
GGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGT 

, ^AlaAsnHisAspSerProAspAlaGluLeuI leGluAlaAsnLeuLeuTrpArgGlnGlu 
4501 CCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGG 
GGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCC 

,.ir C , « ^MetGlyGlyAsn lie Thr Argva 1G 1 uSerG 1 uAsnLysVa lVal 1 1 eLeuAspSer Phe 
4561 AGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGACTCCT 
TCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGA 

i,coi x AspProLeuValAlaGluG 
4621 TC6ATCCGCTTOTGGCGGAGGAGW 

AGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACG 

^^L^§^C9 A C9£ heAlaGlnAlaLeuPr o v alTrpAlaArgProAspTyrAsnProPro 
4681 GGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCC 
CCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGG 

„-,, i ^ r L?yyalGiqh^rpLysLysProAspTyrGluProP 
4741 CGCTAGTGGAGACGTGGAAAAAGCCCGACT^ 

GCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGACAGGCG 

ProProProLysSerProProValProProProArgLysLysArgThrValValLeuThr 
4801 TTCCACCTCCA^CTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCA 
AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGT 

(lOC1 r _quSerThrLeuSerThr^ 
4861 qGAATCAACCCTATCTACTGtt^ 

GACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGA 

/,oo , r a TD-CSerGly 1 1 eThrG lyAspAsnTh rTh rThrSe rSe rG 1 uProA 1 aProSerG 1 yCys 
4921 CAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCTGGCT 
GTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGACCGA 

(lQO . r r PfoProAspSe rAspA 1 aG 1 uSe rTy rSe rSe rMe tProProLeuG 1 uG 1 yG 1 uP roG 1 y 
z * 981 ^QQQpCCCGAqCCGACGCTGACTC^ 

CGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGAC 



) 



FIG. 26F 

AspProAspLeuSerAspGlySerTrpSerThrValSerSerGluAlaAsnAlaGluAsp 
5041 GGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGCGGAGG 
CCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCGCCTCC 

ValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeuValThrProCysAlaAla 
5101 ATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTGCGCCG 
TACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGC 

GluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeu 
5161 CGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCACAATT 
GCCTTCTT6TCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGTGTTAA 

Va lTy rSe rThrThrSerA rgSerA 1 aCysG 1 nA rgG InLysLysVa lthrPheAspA rg 
5221 TGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTTGACA 
ACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAACTGT 

LeuGlnValLeuAspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSer 
5281 GACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCGGCGT 
CTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCA 

LysValLysAlaAsnLeu 
5341 CAAAAGTGAAGGCTAACTTG 
GTTTTCACTTCCGATTGAAC 



FIG. 30 



G 1 yG 1 yG 1 u AsnCysG 1 yTy r A rgA rgCysA rgA 1 aSe rG 1 y Va 1 LeuThrThrSerCys 
1 GGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCAAGCGGCGTACTGACAACTAGCTGT 
CCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGTTCGCCGCATGACTGTTGATCGACA 



GlyAsnThrLeuThrCysTyrl leLysAlaArgAlaAlaCysArgAlaAlaGlyLeuG In 
GGTAACACCCTCACTTGTTACATCAAGGCCCGAGCAGCCTGTCGAGCCGCAGGGCTCCAG 
CCATTGTGGGAGTGAACAATGTAGTTCCGGGCTCGTCGGACAGCTCGGCGTCCCGAGGTC 

— _ Overlap with 19g~ 

AspCy sThrMetLeuVa 1 CysG 1 yAspAspLeuVa 1 Va 1 1 1 eCysG 1 uSerA 1 aG ly Va 1 
GACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTC 
CTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAG 



GlnGluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaPro 
181 CAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCC 
GTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGG 

ProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSerCysSerSerAsn 
241 CCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAAC 
GGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTG 

Va lSe rVa 1 A 1 aH i sAspG lyAlaGl yLysA rgVa 1 Ty rTy r LeuThrA rgAspP roThr 
301 GTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACA 
CACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGT 

ThrProLeuAlaArgAlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeu 
361 ACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTA 
TGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGAT 

GlyAsnl lei leMetPheAlaProThrLeuTrpAla 
421 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 
CCGTTGTATTAGTACAAACGGGGGTGTGACACCCGC 
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FIG. 28 

Overlap with 39c 

LeaLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGlu 
1 TGCTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGG 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCC 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
6 1 AAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAG 
TTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTC 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeu 
121 ACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG ACCTTC 
TGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAG 

GluAspAsnValThrProIleAspThrThrlleMetAiaLysAsnGluValPheCysVal 
181 TGG AAG ACAATGTAAC ACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCG 
ACCTTC TGTT AC ATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
24 1 TTCAGCCTG AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCG ATCTGGGCG 
AAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGC 

ArgValCysGluLysMetAlaLauTyrAspValValThrLysLeuProLeuAlaValMet 
301 TGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTG A 
ACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACT 

GlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
361 TGGGAAGCTCCTACGG ATTCCAATACTCACCAGGACAGCGGGTTG AATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
421 CGTGGAAGTCCAAG AAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG ACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGT 

ValThrGluSerAspIleArgThrGluGluAla 
4 81 CAGTCACTGAGAGCGACATCCGTACGGAGGAGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 



FIG. 29 



GluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACC 
CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGC 

Overlap with 35f 

ArgCysPheAspSerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGln 
61 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLeuAispProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTAT 
ACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

ValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAla 
181 GTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTG AC AACTAGCTGTGGTAAC ACCCTCACTTGCTACATCAAGGCCCGGGC A 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGT 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCC AGG ACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGCTCGGCGTCCCG AGG TCCTG ACGTGGTACG AGC ACACACCGCTGCTG AATCAG 

VallleCysGluSerAlaGlyValGlnGluAspAlaAla 
3 6 1 GTTATCTGTGAAAGCGCGGGGG TCCAGG AGGACGCGGCGAG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 
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FIG. 32A 

, ^IlePheLysIleArgMetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsn 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

, T . LeuLeuTh rTh rThrG 1 nTrpG 1 nVa 1 LeuProCysSerPheTh rThrLeuProA 1 aLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGA 

SerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 

„, , T ^GlySerSerIleAlaSerTrpAlaIleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGAGCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

, LeuAlaAspAlaArgVa 1 CysSe r CysLeuTrpMetMetLeuLeu 1 1 eSe rG 1 nA 1 aG 1 u 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

_AlaAlaLeuGluAsnLeuValIleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAGAACCTCGTAATACTTAATGCAGCATGCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 

Va 1 SerPheLeuVa 1 PhePheCysPheA 1 aTrpTyrLeuLysG lyLysTrpVa 1 ProG 1 y 
421 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGC 

, n , „ AlaValTyrThrPheTyrGlyMetTrpProLeuLeuLeuLeuLeuLeuAlaLeuProGln 
481 GAGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCC 
CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGG 

_ f , _ArgAlaTyrAlaLeuAspThrGluValAlaAlaSerCysGlyGlyValValLeuValGly 
541 AGCGGGCGTACGCGCTGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCG 
TCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGC 

^^Uy | ! 1 S tAlaLeuThrLeuSerProT y rT y rL ysArgTyrIleSerTrpCysLeuTrpTrp 
601 GGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGT 
CCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

^^LeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsn 
661 GGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCA 
CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGT 

™, . ^y?jl ArgG 1 yG 1 yA rgAspA 1 aVa 1 1 leLeuLeuMetCysA 1 aVa 1 H i sProTh rLeuVa 1 
721 ACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGG 
TGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACC 

™, ^ Pj^AspneThrLysLeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAlaSer 
781 TATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCA 
ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

ol , LeuL^LvsValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAla 
841 GTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATC 



) 

0 
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A rgLysMe 1 1 1 eG 1 yG 1 yH i sTy rVa 1 G 1 nMe tVa 1 1 1 e 1 1 eLysLeuG 1 yA 1 aLeuTh r 
901 CGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTA 
GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAAT 

G lyThrTy rVa ITyrAsnH i SLeuThrProLeuArgAspTrpA laH i sAsnG lyLeuArg 
961 CTGGCACCTATGTTTATAACCATCTCACTCCTCTTCG5GACTGGGCGCACAACGGCTTGC 
GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACG 

AspLeuA 1 a Va 1 A 1 aVa 1 G 1 uProVa 1 Va 1 PheSe rG 1 nMetG 1 uThrLy s Leu 1 1 eTh r 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCA 
CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGT 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
1081 CGTGGGGGGCAGATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCC 
GCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGG 

ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 

1141 GCAGGGGCCGGGAGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGT 
CGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCA 

LeuAlaProI leThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysI lei leThr 
1201 TGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCA 
ACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGT 

Se r LeuTh rG 1 yArgAspLysAsnG 1 nVa 1 G 1 uG 1 yG 1 uVa 1 G 1 n 1 1 eVa 1 SerThr A 1 a 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGAC 

AlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAla 
1321 CTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGG 
GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

G 1 yThrA rgThrl 1 eA 1 aSe rProLysG 1 yProVa 1 1 1 eG 1 hMetTy rThr AsnVa 1 Asp 
1381 CCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAG 
GGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATC 

GlnAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuThrProCyrThrCys 
mm ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTT 
TGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAA 

G 1 ySe r Se r AspLeuTy r LeuVa IThrArgHisAl a AspVa 1 1 1 eP rova 1 A rgA rgA rg 
1501 GCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGC 
CGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCG 

GlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSer 
1561 GGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCT 
CCCCACTATCGTCCCCGTCGGACGACAGCTGGGCCGGGTAAAGGATGAACTTTCCGAGGA 

G 1 yG 1 yProLeuLeuCy sProA laGlyHlsAlaValGlyl 1 ePheA rgA 1 aA 1 aVa 1 Cys 
1621 CGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGT6GGCATATTTAGGGCCGCGGTGT 
GCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACA 

ThrA rgG 1 y Va 1 A 1 aLysA 1 aVa 1 AspPhe 1 1 eProVal G 1 uAsnLeuG 1 uTh rTh rMet 
1681 GCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCA 
CGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGT 
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ArgSerProVa 1 PheTh rAspAsnSe rSerP roProVa 1 Va 1 P roG 1 nSe rPheG 1 nVa 1 
17^11 TGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGG 
ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCC 

AlaHisLeuHisAlaProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAla 
1801 TGGCTCACCTCCATGCTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATG 
ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATAC 

, , „ . A|aG 1 nG 1 yTy rLys Va 1 LeuVa 1 LeuAsnProSe rVa 1 A 1 aA 1 aTh r LeuG 1 yPheG 1 y 
1861 CAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTG 
GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAAC 

, M , A 1 aTy rMetSe rLysA laHisGlyl 1 eAspproAsn 1 1 eA rgTh rG 1 y Va 1 A rgThrl le 
1921 GTGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAA 
CACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTT 

ThrThrGlySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
1981 TTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGT 
AATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCA 

- ftJ , SerGlyGlyAlaTyrAspnen 

2041 GCTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACAT 
CGAGCCCCCCGCGAATACTGTATTATTAAACACTGGTCACGGTGAGGTGCCTACGGTGTA 

^ U§Uy§ 1 y II eG 1 yTh rVa 1 LeuAspG 1 n A 1 aG 1 uTh rA 1 aG 1 yA 1 a A rgLeuVa 1 Va 1 
2101 CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTG 
GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAAC 

TO LeuAlaThrAlaThrProProGlySerValThrValProHisProAsnIleG^ 
2161 TGCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGG 
ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCC 

nrtn , A 1 aLeuSe rTh rThrG 1 yG 1 u 1 1 eProPheTy rG 1 yLy s A 1 a 1 1 eProLeuG 1 uVa 1 1 1 e 
2221 TTGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAA 
AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATT 

oo„, T ^L y ^I^MC9Ui sLeuIlePheC y sHIsSerL ysLysLysCysAspGluLeuAlaAla 
2281 TCAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCG 
AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGC 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
2341 CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCG 
GTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGC 

oin , I leProThrSerG lyAspVa lVa 1 Va 1 Va 1 A laThrAspA laLeuMetTh rG 1 yTy rThr 
2401 TCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATA 
AGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATAT 

GlyAspPheAspSerVallleAspCysAsnThrCysValThrGlnThrValAspPheSer 
2461 CCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCA 
GGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGT 

^LeuAspProThrPheThrlleG^ 
2521 GCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCA 
CGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGT 

„ Cfl , G 1 nA rgArgG 1 yA rgTh rG 1 yA rgG 1 yLysProG ly I leTyrA rgPheVa 1 A 1 aP roG 1 y 
2581 CTCAACGTCGGGGCAGGACT 

GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCC 

„, _ ^GluArgProSerGlyMe^ 

2641 GGGAGCGCGCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCT 
CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

1 aTrpTy rG 1 uLeuTh rProA laG 1 uTh rThrVa 1 A rgLeuArgA 1 aTy rMe tAsnTh r 
2701 GTGCTTGGTATGAGCTCACGCCCG^ 

CACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGT 



FIG. 32D 

ProG 1 yLeuProVa 1 CysG 1 n AspHl sLeuG 1 uPheTrpG 1 uG 1 y Va 1 PheTh rG 1 y Leu 
2761 CCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCC 
GGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGG 

ThrHislleAspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyr 
2821 TCACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTGCTT 
AGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAA 

LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGG 
TGGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCC 

GlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
2941 ACCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGC 
TGGTCTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACG 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACA 
ATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGT 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGly 
3061 TCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCG 
AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGC 

Va 1 Leu A 1 a A 1 aLeu A 1 a A 1 aTy r Cy sLeuSe rTh rG 1 y Cys Va 1 Va 1 1 1 eVa 1 G 1 y A rg 
3121 GCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCA 
CGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGT 

Va 1 Va 1 LeuSe rG 1 y LysProA 1 a 1 1 e 1 1 eProAspA rgG 1 uVa 1 LeuTy r A rgG 1 uPhe 
3181 GGGTCGTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGT 
CCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCA 

AspG 1 uMe tG 1 uG 1 uCysSe rG 1 nH i sLeuP roTy r 1 1 eG 1 uG 1 nG 1 yMe tMe tLeuA 1 a 
3241 TCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCG 
AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGC 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVal 
3301 CCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGG 
GGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC 

I leAl aProA 1 aVa 1 G 1 nTh rAsnTrpG 1 nLysLeuG 1 uThrPheTrpA 1 aLysH i sMet 
3361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATA 
AATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTAT 

TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
3421 TGTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACC 
ACACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

A 1 a 1 1 eA 1 aSe rLeuMet A 1 aPheTh rA 1 aA 1 aVa ITh rSerProLeuTh rTh rSerG 1 n 
3481 CCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCC 
GGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGG 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
3541 AAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTG 
TTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCAC 

A 1 aTh r A 1 aPheVa 1G 1 yA 1 aG 1 yLeuA 1 aG 1 yA 1 a A 1 a 1 1 eG 1 ySe rVa 1G 1 yLeuG 1 y 
3601 CCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGG 
GGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACC 

LysValLeuI leAspI leLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAla 
3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATGGGGCGGGCGTGGCGGGAGCTCTTGTGG 
CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACC 

PheLysIleMetSerGlyGluValProSerThrGluAspLeuValAsnLeuLeuProAla 
3721 CATTCAAGATCATGAGC€GTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCG 
GTAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGC 



FIG. 32E 

„„ 1 1 eLeySe rPrgG 1 yA 1 aLeuVa 1 Va 1 6 1 y Va 1 Va 1 Cys A 1 a A 1 a 1 1 eLeuA rg A rgH 1 s 
3781 CCATCCTCTC6CCC66A6CCCTC6TA6TCGGC6TG6TCTGT6CA6CAATACTGCGCCG6C 
GGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

70M . ValGlyProGlyG 1 uG 1 vA 1 aVa 1 G 1 nTrpMe t Asn A rgLeuI 1 e A 1 aPheA 1 aSe r A rg 
3841 ACGTTGGCCCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

3901 GGGGg'aACCATGTTTC^ 

CCeCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGT 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
3961 CTGCCATACTCAGCAGCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAA 
GACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATT 

, „„, „ SerGluCysThrThrProCysSerGlySerTrpLeuArgAspI leTrpAspTrpI leCys 
4021 GCTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATA 

,n a , ^ GluValLeuSerAspPI^LysThrTrpLeuLysAlaLysLeuMet 
*»081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTG 
CGCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGAC 

, , , , I leProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlyl leMet 
4141 GGATCCCGTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCA 
CCTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGT 

,o«, ^HisThrArgCysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg 
4201 TGCACACTCGCTGCCACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGA 
ACGTGTGAGCGACGGTGACACCTCGACTCTA6TGACCTGTACAGTTTTTGCCCTGCTACT 

.o^-, IleVaj^MroArgT^ 

4261 GGATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGA 

. v« , . /J&TDdi 1 yProCysThrProLeuProA 1 aProAsnTy rThrPheA 1 aLeuTrpA rgVa 1 
4321 ACACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCC 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPh 
4381 TGTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
ACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCAT 

, , , , ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGluLeu 
4441 TGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

. ^£?B G 1 yVa 1 ArgLeuH i sArgPheA 1 aProProCysLysProLeuLeuA rgG 1 uG 1 uVa 1 
4501 TGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGG 
ACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

, cc , T A C?0?^ f 1 5 A YV=§y 1 §5 i '"^ roV3 1 G 1 ySs rG 1 n LcuP roCy s G 1 u P roG 1 u- 
4561 TATCATTCAGAGTAG6ACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCG 
ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGC 

, , . . ProAspVa 1 A 1 aVa 1 LeuThrSerMetLeuThrAspProSerH i s 1 1 eTh rA 1 aG 1 uA 1 a 
4621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGG 
TTG6CCTGCACCGGCACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCC 

. A laG lyArgArgLeuA laArgGlySerProProSerVa 1 A laSerSerSerA laSerG 1 n 
4681 CGGCCGG6CGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCC 
GCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGG 

, , , , . LeuSerA 1 aProSerLeuLys A 1 aTh rCvsTh rA 1 aAsnH i sAspSe rPro AspA 1 aG 1 u 
4741 AGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCT 
TCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGAC 
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.^uIlfGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsnlleThr-ArgValGlu 
4801 AGCTCATA6A6GCCAACCTCCTATGGA66CAGGAGATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAAC 

, . SerGluAsnLysValVallleLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
4861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACG 
TCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGC 

, 001 .-ArgGluIleSerVal^^ 

4921 AGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCC 
TCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGG 

, T „ProyalIrpAlaArgProAspTyrAsnProProLeuValGluThrTrpLysLysProAsp 
4981 TGCCCGTJTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGC 

. T y rG 1 uProProVal Va 1 H i sG 1 yCysProLeuProProProLysSerProProVa 1 Pro 
5041 ACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGC 
TGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACG 

^ProProArgLysLysArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAla 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

, GlyLeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThr 
5161 CCGAGCTCGCCACCAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATA 
GGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

roo-. rrIPrTP^r§^ly^9^jl§^2 s ? rG Jly c y sPr oProAspSerAspAlaGluSerTyr 
5221 CGACAACATCCTCTGAGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCT 
GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGA 

, SerSerMetProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
5281 ATTCCTCCATGCCCCCCCTGGAG^ 

TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

Se rTh rVa 1 Se rSe rG 1 u A 1 a Asn A 1 aG 1 uAspVa 1 Va 1 CysCy sSe rMe tSe rTy r Se r 
5341 GGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACT 
CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGA 

TrpJhrGlyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAla 
5401 CTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATG 
GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTAC 

r. LeuSerAsnSerLeuLeuArgHisHisAsnLeuV 
5461 CACTAAGCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 

m , sG 1 nA r gG 1 nLys Lys Va 1 Th r PheAspA rgLeuG 1 nVa 1 Leu AspSe rH i sTy rG 1 n 
5521 CTTGCCAAAGGCAGAAGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACC 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGG 

cco , .^AspyalLeuLysGluyalLysA^ 

5581 AGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCG 
TCCTGCATGAGTTCGTCCAATTTGGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGC 

_ . . , T G 1 uG 1 yAl aCysSerLeuJhrProProH i sSe rA 1 aLysSe r Ly s PheG 1 yjy rG 1 yA 1 a 
5641 TAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGG 
ATCTCCTTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCC 

LysAspValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAsp 
5701 CAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG 
GTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTC 

, . LeuLeuG 1 uAspAsnVa 1 ThrPro 1 1 eAspTh rTh r I leMetA laLysAsnG 1 uVa 1 Phe 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTT 
TGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCGAAA 



FIG. 32G 

CysvalG 1 nProG 1 uLVsG 1 yG 1 yArgLysProA 1 aA rgLeu 1 1 eVa 1 PheP roAspLeu 
5821 TCTGC6TTCAGCCT6AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATC 
AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAG 

GlyValArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAla 
5881 TGGGCGTGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGG 
ACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACC 

Va 1 Me tG 1 ySe rSe rTy rG 1 y PheG 1 nTy rSe rProG 1 yG 1 nArgVa 1 G 1 uPheLeuVa 1 
5941 CCGTGATGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCG 
GGCACTACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGC 

GlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAsp 
6001 TGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG 
ACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAAC 

SerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGlnCysCysAspLeu 
6061 ACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACC 
TGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGG 

AspProGlnAlaArgValAlalleLysSerLueThrGluArgLeuTyrValGlyGlyPro 
6121 TCGACCCCCAAGCCCGCGTGGCCATCAAGTCCGTCACCGAGAGGCTTTATGTTGGGGGCC 
AGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGG 

LeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeu 

6181 CTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTAC 
GAGAATGGTTAAGTTCCCCCeTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAla 
6241 TGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAG 
ACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTC 

A 1 aG lyLeuG 1 nAspCysTh rMetLeuva 1 CysG 1 yAspAspLeuVa 1 Va 1 1 1 eCysG 1 u 
6301 CCGCAGGGCTCCAGGACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTG 
GGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACAC 

Ser A 1 aG 1 y Va 1 G 1 nG 1 u AspA 1 a A 1 aSe rLeuArgA 1 aPheTh rGluAl aMetTh rA rg 
6361 AAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCA 
TTTCGCGCCCCCA6GTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGT 

TyrSerAlaProProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSer 
6421 GGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACAT 
CCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTA 

CysSerSerAsnValSerValAlaHisAspGlyAlaGlyLysArgValTyrTyrLeuThr 
6481 CATGCTCCTCCAACGTGTCAGTCGCCCACGACGGCGCTGGAAAGAGGGTCTACTACCTCA 
GTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGT 

Arg AspProTh rTh rProLeuA 1 aA rgA 1 aA 1 aTrpG 1 uTh rA 1 aA r gH I sTh rP roVa 1 
6541 CCCGTGACCCTACAACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAG 
GGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTC 

AsnSerTrpLeuGlyAsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATAC 
AGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACCCGCTCCTACTATG 

MetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCys 
6661 TGAT6ACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATT 
ACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAA 

Glul leTyrGlyAlaCysTyrSerl leGluProLeuAspLeuProProI lei leGlnArg 
6721 GCGAGATCTACGGGGCCTGCTACTCCATAGAACCACTTGATCTACCTCCAATCATTCAAA 
CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTT 

Leu 

6781 GACTC 
CTGAG 
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Sample 

Chimp date (days) ALT (alanine) 

Lane Reference Infection (O=inoculation aminotransferase 



Number 


Number 


Type 


day) 


level in 


sera 


1 


1 


NANB 


0 
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2 


1 


NANB 


76 




71 


3 


1 


NANB 


118 




19 


4 


1 


NANB 


154 




M / A 

N/A 


5 


2 


NANB 


0 




0 


6 


2 


NANB 


21 




52 


7 


2 


NANB 


73 




13 


8 


2 


NANB 


138 




hi / A 

N/A 


9 


3 


NANB 


0 




8 


10 


3 


NANB 


43 




205 


11 


3 


NANB 


53 




14 


12 


3 


NANB 
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13 


4 


NANB 


-3 




11 


14 


4 


NANB 


55 




132 


15 


4 


NANB 


83 




N/A 


lb 


4 


M AMU 


14U 




M / A 
IN / A 


17 


5 


HAV 


0 
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18 


5 


HAV 


25 




147 


19 


5 


HAV 


40 




18 


on 


-> 


HAV 






c 


21 


6 


HAV 


-8 




N/A 


22 


6 


HAV 


15 




100 


£.5 


b 


U A\/ 

HAV 


Zil 






24 


6 


HAV 


129 




N/A 


26 


7 


HAV 


0 




7 


27 


7 


HAV 


22 




83 


oo 


/ 


U AW 

HAV 








29 


7 


HAV 


139 




N/A 


30 


8 


HAV 


0 




15 


31 


8 


HAV 


26 




130 


32 


8 


HAV 


74 




8 


33 


8 


HAV 


205 




5 


34 


9 


HBV 


-290 




N/A 


35 


9 


HBV 


379 




9 


36 


9 


HBV 


435 




6 


37 


10 


HBV 


0 




8 


38 


10 


HBV 


111- 


•118 (pool) 


96- 


39 


10 


HBV 


205 




9 


40 


10 


HBV 


240 




13 


41 


11 


HBV 


0 


■56 (pool) 


11 


42 


11 


HBV 


28- 


8 


43 


11 


HBV 


169 




9 


44 


11 


HBV 


223 




10 



(pool) 



■100 (pool) 
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Patient 






Lane 


Reference 






XT. V •» 

Number 






A-uX XiCVGX V, l 


' 1 


h 




1354 


Z 




MA MR 

it All O 


31 




2l 


MAMR 


14 

X *T 


A 
** 


2l 


NANR 


79 


c 

J 


2l 


NANR 


26 


6 




NANB 


78 


7 


il 


NANB 


87 


Q 
O 




NANR 


25 


rv 

9 




MAMTJ 
riAli-D 


60 


10 




NANB 


13 


11 




NANB 


298 


1Z 




MA MR 
IN All Jj 


101 


J. J 


6l 


NANR 


474 


14 


fil 


MA MR 
IN AINU 


^1 ft 


15 


?I 


MA MR 


90 


16 




MAMR 


X 


1/ 


ftl 


MAMR 


A A 
H *t 


1 Q 


ll 


MAMR 


50 


1 Q 

19 


Q 

9 


MAMR 


N/A 

IT / A 




1U 


MAMR 


N /A 


21 


11 


MAMR 


N/A 

Li / A. 


ZZ 


1Z 


Li v» JL. lllct J. 


N/A 


Zj 




N/*IT""mfl 1 
llUl Uiu. J. 


N/A 








N/A 


26 


30174, 


Normal 


N/A 


27 


30105 


Normal 


N/A 


28 


30072 


Normal 


N/A 


29 


30026 


Normal 


N/A 


30 


30146 


Normal 


N/A 


31 


30250 


Normal 


N/A 


32 


30071 


Normal 


XT / A 

N/A 


33 


15 


AcuteHAV 


XT / A 

N/A 


34 


16 


AcuteHAV 


XT / A 

N/A 


35 


17 


AcuueflAV 


N/A 

ll / A 


36 


18 


AcuteHAV 


N/A 


37 


48088 


AcuteHAV 


N/A 


38 


47288 


AcuteHAV 


N/A 


39 


47050 


AcuteHAV 


N/A 


40 


46997 


AcuteHAV 


N/A 


41 


19 


Convalescent HBV 


N/A 


42 


20 


( anti-HBSag+ve ; 


N/A 


43 


21 


anti-HBCag+ve) 


N/A 


44 


22 


(anti-HBSag+ve; 


N/A 


45 


23 


anti-HBCag+ve ) 


N/A 


46 


24 


( anti-HBSag+^e ; 


N/A 


47 


25 


anti-HBCag+ve) 


N/A 


48 


26 


( anti-HBSag+ve ; 


N/A 


49 


27 


anti-HBSag+ve) 


N/A 



^ Sequential 



serum samples were assayed from these patients 
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--------SOD CO0H1 [—adaptor ] [NANBHpolypeptide> 

, A 1 aCysG 1 y Va 1 1 leGlyl leAlaGl nAsnLeuG lyl 1 eA rg AspA 1 aH i sPheLeuSe r 
1 GCTTGTGGTGTAATTGGGATCGCCCAGAATTTGG 

CGAACACCACATTAACCCTAGCGGGTCTTAAACCCTTAAGCCCTACGGGTGAAAGATAGG 

»>»»»»>>>»>»>> 
^ , GlpTh r Ly sG 1 nSe rG 1 yG 1 u Asn LeuProTy rLeuVa 1 A 1 aTy rG 1 n A 1 aTh rVa 1 Cy s 
61 QAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 

GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuileArgLeu 
121 GCTAG66gCMGCCCCTCCCCCATC6T6GGACCAGAT6TGGAAGTGmGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

i on ^^P^oJhrLeuHisG JLyProThrProLeuLeuTyrA rgLeuG 1 yA 1 aVa 1 G 1 n AsnG 1 u 
181 AAGCCCACCCTCCATGGGa 

TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTAGTT 

n , , UejhrLeuThrH i sProVa IThrLysTy r I leMetThrCysMetSerA 1 aAspLeuG 1 u 
241 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCT-C 

, ni X? IVa 1 Th rSerTh rTrpVa 1 LeuVa 1 G 1 yG 1 yVa 1 Leu A 1 a A 1 aLeuA 1 aA 1 aTy rCy s 
301 gCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACG 

LeuSerThrGlyCysyalVallleValGlyArgValValLe^^^ 

TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTG 

. n , LeuProTy r I leGluGl nG lyMetMe tLeuA 1 aG 1 uG 1 nPheLysG 1 nLy s A 1 aLeuG 1 y 
481 TTACCCTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGC 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuGlnThrAlaSerA^ 

601 

GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGl^ 
661 TTGGCGGGCTTGTCAACGCTGCCTG 

AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

w AlaAlaValTlirSerProLeuThrThrS^ 

721 GgGgCTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCmAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

_ Q , TrpValAlaAlaG 1 nLeuA 1 a A 1 aProG 1 yA 1 a A 1 aTh r A 1 aPhe Va lGlyAlaGlyLeu 
781 TGG6TGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 



FIG. 36B 

nt , Ala61yAlaAlaIleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 

ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

„, SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuVa 
961 TCCACGGAGGACCTGGTCAATCTACTG 

AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

, no , GlyVa 1 Va 1 CysA la A la 1 1 eLeuA rgA rgH 1 sVa 1 G 1 y ProG 1 yG 1 uG 1 yA 1 aVa 1 G 1 n 
1021 GGCGTGGjqGTGCAGCAATACTGCGC 

CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

T m *. . , ? i a i ■>> <«<<««««««<«NANBH]f — extra 
, nn , TrpMe tAsn ArgLeu I leAl aPheA 1 aSe rA rgG 1 yAsnH 1 sVa 1 Se rProVa lHlsHis 
1081 TGGATGAACC66CTGATAGCCnC6CCTCCCG66GGAACCAT6TnCCCCA6TCCATCAT 

ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGTCAGGTAGTA 

] 

LysArgOP 

mi AAGCGTTGACGCTCCCTACGGGTGGACTGTGGAGAGACAGGGCACTGCTAAGGCCCAAAT 
TTCGCAACTGCGAGGGATGCCCACCTGACACCTCTCTGTCCCGTGACGATTCCGGGTTTA 

1201 CTCAGCCATGCATCGAGGG^^ 

GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTTGTTGATCGCGCATGCATTTCAG 

1261 TCCrTTCTCGATGGTCCATACCTTAGATGCGTTAGCATTAATCCGAATTC 
AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGGCTTAAG 



FIG. 37A 



FIG. 37B 

1 2 




FIG. 40 




A 
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FIG. 42A 

10 20 30 40 50 

HCV EYVVLLFLLLADARVCSCLWMMLLISQAEAALENLVILNAASLAGTHGLVSFLVFFCFA 

MNWVD1 AVSFVTLITGNMSFRDLGRVMVMVGATMTDDIGMGVTYLALLAAFKVRPTFAAGLLLRKL 
130 140 150 160 170 180 

60 70 80 90 100 110 

HCV WYLKGKWVPGAVYTFYGMWPLLLLLLALPQRAYALDTEVAASCGGVVLVGLMALTLSPYY 

MNWVD1 TSKELMMTTIGIVLLSQSTIPETILELTDALALGMMVLKMVRKMEKYQLAVTIMAILCVP 
190 200 210 220 230 240 

120 130 140 150 160 170 

HCV KRYISWCLWWLQYFLTRVEAQLHVWIPPLNVRGGRDAVILLMCAVHPTLVFDITKLLLAV 



MNWVD1 NAVILQNAWKVSCTiLAVVSVSPLFLtsSQQKADW^ 

250 260 270 280 290 

180 190 200 210 220 230 

HCV FGPLWILQASLLKVPYF-VRVQGLLRF-CALARKMIGGHYVQMyi IKLGALTGTYVYNHL 

MNWVD1 KKRSWPLNEAiMAVGMVSiLASSLtKNDIPMtGPLVA 

300 310 320 330 340 350 

240 250 260 270 280 290 

HCV TPLRDWAHNGLRDLAVAVEPWFSQMETKLITWGADTAACGDI INGLPVSARRGREILLG 

MNWVD1 ADVK-WEMAEISGSSPiLS^™ 

360 370 380 390 400 410 

300 310 320 330 340 350 

HCV PADGMVSKGWRLLAPITAYAQQTRGLLGCIITSLTGRDKNQVEGEVQIVSTAAQTFLATC 

MNWVD1 VS IPltAAAWYLWEVKKQRAGVLWDVPSPPPVGKAELEDGAYR I KQKGi LGYSQ I GAGVY 
420 430 440 450 460 470 

360 370 380 390 400 410 

HCV I NGVCWTVYHGAGTRTI ASPKGPV I QMYTNVDQDLV GWPAPQGSRSLTPCTCGSSD 

MNWVD1 KEGtFHtMWHVtRGAVLMHKGKRiEPSWADVKK6LvSCGG 

480 490 500 510 520 530 

420 430 440 450 460 470 

HCV LYLVTRHADVIPVRRRGDSRGSLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGV 

MNWVD1 PGKNPRAVQtKPGLFKTN — AGTiGAVSLDFSP^t^SPiiDKK^KVV^LYGNGVVtRSG 
540 550 560 570 580 590 



• 
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180 190 500 510 520 ^0 

HCV AKAVDFIPVENLETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKS — TKVPAAYAAQ 

MNWVD1 AYVSA i AQTEK--S I EDNJ^EI EflDI FRK — RKLTiMDiAPGAIjKtkRYLPA i VRGA I KR 
600 610 620 630 640 

HCV GYKVLVLNPS--VAATLGFGAYMSKAHGIDPNIRTGVRTITTGSPITYSTYGKF 

* i • ■ ■ • ■ lit 

MNWVD1 GLRTULAPtRVVAAEMEEALRGLPI^ 

650 660 670 680 690 700 

590 600 610 620 630 640 

HCV SGGAYDI 1 1 CDECHSTDATS I LGI GTVLDQAETAGARLVVLATATPPGSVTVPHPN I EEV 

MNWVD1 RVPNYNL i i MDEAHFi#A$ i A ARGY i stRVE-MGEAAG i FMtAtpM§RD-PFI^QSN AP 
710 720 730 740 750 760 

650 660 670 680 690 700 

HCV ALSTTGE I PFYGKA I PLEV I KGGRHL I FCHSKKKCDELAAKLVALG I NAVA YYRGLDVSV 

MNWVD1 IMDEEREi PERSWSSGHEWVTDFKGKTVWFVPS I KAGNDTAACLRKNGKKVTQLSRKTFD 
770 780 790 800 810 820 

710 720 730 740 750 760 

HCV I PTSGDVVVVATDA LMTGYTGDFDSV I DCNTCVTQTVDFSLDPTFT I ET I TLPQDA VSRT 

MNWVD1 SEYVKTRTNDWNFVVTTDISEMGANFKAERVIDPRRCMKPVILTDGEERVILAGPMPVTH 
830 840 850 860 870 880 

770 780 790 800 810 820 

HCV QRRGRTGRGKPGIYRFVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNT 

MNWVD1 SS 
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Name Common Sequence 

5 ' - 3 - 1 AAGCTTGATCGAATTC 
-2 
-3 
-4 
-5 
-6 

-7 

-8 

-9 
-10 
-11 
-12 

-13 
-14 
-15 
-16 
-17 
-18 

-19 
-20 
-21 
-22 
-23 
-24 

-25 
-26 
-27 
-28 
-29 
-30 

-31 
-32 
-33 
-34 
-35 
-36 



Variable Sequence 

CGATCTTGC 
CGATCCTGC 
CGATCATGC 
CGATCGTGC 
CGAAGTTGC 
CGAAGCTGC 

AGATCTTGC 
AGATCCTGC 
AGATCATGC 
AGATCGTGC 
AGAAGTTGC 
AGAAGCTGC 

CGATCTTGT 
CGATCCTGT 
CGATCATGT 
CGATCGTGT 
CGAAGTTGT 
CGAAGCTGT 

AGATCTTGT 
AGATCCTGT 
AGATCATGT 
AGATCGTGT 
AGAAGTTGT 
AGAAGCTGT 

CGCTCTTGC 
CGCTCCTGC 
CGCTCATGC 
CGCTCGTGC 
CGCAGTTGC 
CGCAGCTGC 

CGCTCTTGT 
CGCTCCTGT 
CGCTCATGT 
CGCTCGTGT 
CGCAGTTGT 
CGCAGCTGT 
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FIG. 47 A 

, 6 lyCysProG luArgLeuA 1 aSe rCysArgProLeuThr AspPheAspG 1 nG 1 yTrpG 1 y 
1 CAGGCTGTCCTGAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGG^ 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

r-, ^ProIleSerTyrAlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrPro' 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGC^ 

CGGGATAGTCAATACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGG 

^ ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAG 

GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGT 

PrgSerProValValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGG6CGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCACCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCC 

0 . , J? 1 uAsnAspThr AspVa 1 PheVa 1 LeuAsnAsnThr A rgProProLeuG 1 yAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 

CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

301 TC^lWM?^ 

AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

7n ^HeGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAA 

AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

■ 01 AspAlaThrTyrSerArg^ leThrProArgCysLeuValAsp 
121 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

,. 0 1 a Tyr ProTy r ArgLeuTr pH i sTy rProCysTh r 1 1 e AsnTy rThrl 1 ePheLys 1 1 eA rg 
181 ACTACCCGTATAGGCTTT^ 

TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

Cl . „ MetTyrValGlyGlyValGluH^ 

511 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

cni , ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeuLeuLeuTbrThrTh 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

r- r , ^ GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

Hi sLeuH i sG 1 n Asn 1 1 eVa 1 AspVa 1 G 1 nTy r LeuTy rG 1 y Va 1 G 1 ySe rSe r 1 1 eA 1 a 
721 TCCACCTCCACCAGAACATTGTG 

AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

-, 0 •, RAla 1 1 eLysTrpG 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA 1 aAspA laA rg 

781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGC 
GCAGGACCCGGTAATTCACCCTCATGGAGCAAGAGGACAAGGAAGACGAACGTCTGCGCG 

01. i rry?^X5^^X?t^Ii:R |V|e J ,V|etLeuLeuI leSerGI nA'laG 1 uA laA 1 aLeuG 1 uAsn 

TGGAGCATTATGAATTACGTCGTAGGGACGGGCCCTGCGTGCCAGAACATAGGAAGGAGC 



1 



FIG. 47B 

PhePheCysPheA laTrpTy rLeuLysG 1 yLy sTrpVa 1 ProG 1 yA 1 aVa lTy r I n rPhe 
961 TGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA , , 

TyrGlyMetTrpProLeuLeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeu 
102 1 TCTACGGGAT6TG6CCTCTCCTCCTGCTCCT6TTGGCGTTGCCCC AGCGGGCGTACGCGC ' 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspTh rG 1 uVa 1 A 1 a A 1 aSe rCysG 1 yG 1 y Va 1 Va 1 LeuVa 1 G 1 yLeuMet A 1 aLeuTh r 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACT 

LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeu 
1141 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 

Thr A rgVa 1G 1 uA 1 aG 1 nLeuH i s Va 1 TrpI 1 eProProLeuAsnVa 1 A rgG 1 yG 1 yA rg 
1201 TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTC^ 

ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 

AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 
1261 GCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCA 
CGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 

LeuLeuLeuA 1 aVa 1 PheG 1 yProLeuTrpI 1 eLeuG 1 nA 1 aSe rLeuLeuLysVa 1 Pro 
1321 AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTAC 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

TyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAlaArgLysMetlleGly 
1381 CCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCG 
GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

G 1 y H i sTy rVa 1 G 1/iMe tVa 1 1 1 e 1 1 e Ly s Le uG 1 y A 1 aLeuTh rG 1 yTh rTy r Va 1 Ty r 
mm GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTT 
CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

AsnH i sLeuTh rProLeuArgAspTrpA 1 aH i sAsnG lyLeuA rgAspLeuA 1 aVa 1 A 1 a 
1501 ATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGG 
TATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC 

Va 1 G 1 uP r oVa 1 Va 1 PheSe rG 1 nMe tG 1 uTh r Ly s Le u 1 1 eTh rTr pG 1 yA 1 a AspTh r 
1561 CTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATA 
GACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTAT 

AlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArgArgGlyArgGluIle 
1621 CCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGA 
GGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCT 

LeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThr 
1681 TACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCA 
ATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGT 

AlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysI lei leTtlrSerLeuThrGlyArg 
1741 CGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCC 
GCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGG 

AspLysAsnGlnValGluGlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeu 
1801 GGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCC 
CCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGG 

A 1 aThrCysl 1 eAsnG 1 yVa 1 CysTrpThrVa lTyrHi sG 1 yA 1 aG 1 yThrA rgTh r 1 1 e 
1861 TGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAAW 

ACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGT 

AlaSerProLysGlyProVallleGlnMetTyrThrAsnyalAspGlnAsp^ 
1921 TCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGG 
AGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACC 



) 
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, TrpProAlaProGlnGlySerArgSerLeuThrProCysThrCysGlySerSerAspLeu 
1981 GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCC'TCGGACC: 
CGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGG 

on , , TyrLeuValThrArgHisAlaAspVallleProValArgArgArgGlyAspSerArgGly- 
2041 TTTACCTGGTGACGAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

o-,n, o^SerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeuLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGT 
CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCysThrArgGlyValAla 
2161 TGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACC 

oooi r T U^jl?y?^fRK?U^2y a Jl GluAsnLeuGluThrTh rMetArgSerProValPhe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ooo -. -rrTP^??^?P§?t§?f ProProVa 1 Va 1 P roG 1 nSe rPheG 1 nVa 1 A 1 aH i sLeuH i s A 1 a 
2281 TCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGG 

AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

o, , , ,. T ££2TD rG JX§§ rG ly LysSe rTh rLysVa 1 P roA 1 aA 1 aTy rA 1 a A 1 aG 1 nG 1 yTy rLys 
2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

o.n-. , V|lL?yya]Uy A §Q ProSerValAlaAlaThrLeu GlyPheGlyAlaTy 
2401 AGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
TCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGT 

o,. n ^ A laHisGlyIleAspProAsnIleArgThrGlyValArgThrIleThrThrGlySerPro 
2461 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

ocol ^^HeThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTT 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAA 

oroi * A spneneIleCysAspGluCysHisSerTh-rAspAlaThrSerIleLeuGlyIleGly 
2581 ATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCG 
TACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGC 

or , , ^IPtX 3 - InAlaGI uTh rA 1 aG 1 yA 1 aA rgLeuVa 1 Va 1 LeuA 1 aTh rA 1 aTh r 

2641 gcactgtccttgaccaagcagagactgcgggggggagIctggttgtgctcgccaccgcca 
cgtgacaggaactggttcgtctctgacgcccccgctctgaccaacacgagcggtggcggt 

ProProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCA 
GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGT 

, 7C1 „ GlyGluIleProP^TyrGly 

2761 CCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGAC 
GGCCTCTCTAGG6AAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTG 

ooo-, . T j^uHePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeu 
2821 ATCTCATCTTCTGTCATTCAAAGAA6AAGTGCGACGAACTCGCCGCAAAGCTGGTCGCAT 
TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

o OQ1 GlyUeAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVallleProThrSe 
2881 TGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGC 

CGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGA 



FIG. 47D 

Val I leAspCysAsnThrCvsValThrGlnThrVal AspPheSerLeuAsDProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

Thrl leGluThrl leThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArg . 
3061 TCACCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCA 
AGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGT 

ThrG lyArgG 1 yLysProG ly 1 1 eTyrA rgPheVa 1 A 1 aProG LyG 1 uArgProSerG 1 y 
3121 GGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerVa 1 LeuCysG 1 uCysTy r AspA 1 aGlyCysA laTrpTy rG luLeu 
3181 GCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
3301 TGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 
ACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGln 
3361 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 
GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGG 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
3421 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 
TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
3481 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGC 

ValGlnAsnGluIleThrLeuThrHisProValThrLysT-yrlleMetThrCysMetSer 
3541 CTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGT . 
GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACA 

A 1 a AspLeuG 1 uVa 1 Va 1 ThrSe rTh rTrpVa 1 LeuVa 1 G 1 yG 1 yVa 1 LeuA 1 aA 1 aLeu 
3601 CGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTT 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAA 

A 1 a A 1 aTy rCysLeuSe rTh rG 1 y Cy sVa 1 Va 1 1 1 eVa 1 G 1 yA rgVa 1 Va 1 LeuSe rG 1 y 
3661 TGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCG 
ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGC 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
3721 GGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAG 
CCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTC 

CysSerGlnHlsLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
3781 AGTGCTCTCA6CACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGC 
TCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCG 

LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
3841 AGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTG 
TCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGAC 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
3901 TCCAGACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCA 
AGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGT 

G ly 1 1 eG 1 nTy r LeuA 1 aG 1 y LeuSe rThrLeuProG 1 yAsnP roA 1 a 1 1 e A 1 aSe rLeu 
3961 GTGGGATACAATACTTGGGGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCAT 
CACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 



FIG. 47E 

,«o, T ^etAlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
4021 TGATGGCTTTTACJ\^ 

ACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGT 

/,no 1 *A IfLeuGlyGlyTrpVa 1 A 1 aA 1 aG 1 nLeuA 1 a A 1 aP roG 1 yA 1 a A 1 aTh r A 1 aPheVa 1 
4081 ACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTG 

TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAAC 

6I^ laG1 y LeuAlaG lyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsD 
4141 TGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTG^ 

ACCCGCGACCGAATCGACGGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

, ™, . ^ III uA 1 aG 1 yTy rG 1 yA 1 aG 1 y Va 1 A 1 aG 1 yA laLeuVa 1 A 1 aPheLys 1 1 eMetSer 
4201 ACATCCTTGCAGGGTATGGCGCGGG 

TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACT 

. GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
4261 GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCT 

CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlal leLeuArgArgHisValGlyProGlyGlu 
4321 GAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAAT^ 

CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGC 

4381 AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGG 

TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

/, ,. ,. 1 ^EKTDEH } sTy rVa 1 P roG 1 use r Asp A 1 aA 1 a A 1 a A rgVa 1 Th rA 1 a 1 1 eLeuSe rSe r 
4441 CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

(lCm LeuThryalThrGln 

4501 GCCTCACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCA 
CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGT 

,.r C -, o-r^SX^tCSI^erTrP'-euArgAspIleTrpAspTrpIleCysGluValLeuSerAsp 
4561 CTCCATGCTCCGGTK 

GAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGC 

A PheLysTh^rpLeuLysAlaLysLeuMetProGln 
4621 ACTTTAAGACCTGGCTAAAAGCTAA 

TGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACA 

1 co , CysGlnArgGlyTyrLysGlyValTrpArgValAspGlyl leMetHisThrArgCysHis 
4681 CCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCC 
GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

TGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGAT 

i.oni r rIPr£X2fcg A fPtfSJj£R§S£^ 

*»801 GGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCAn 

CCTGGACGTCCTTGTACACCTCAGCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGA 

/.oc 1 r T I&E£l&§y££§£ 1 aProAsnTy rThrPheA 1 aLeuTrpArgVa 1 SerA 1 aG 1 uG 1 uTy r 
^ 861 S^P^TI^T^^CGAACTA^ 

CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 

z » 921 S G IGG A G A TAAGGCAGGTGGG^ 

TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

. T AysCysPrgCy sG 1 nVa 1 ProSerProG 1 uPhePheTh rG 1 uLeuAspG 1 y Va 1 A rgLeu 
4981 TCAMTGCCCGTGCGAGGTCCC^ 

AGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGG 
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HisArgPheAlaProProCysLysProLeuLeuArgGluGluValSerPheArgValGly 
5011 TACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG 
ATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCGATAGTAAGTCTCATC 

LeuH i sG 1 uTy rProVa 1 G lySerG 1 nLeuProCysG 1 uProG 1 uProAspVa 1 Al aVa 1 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCG 
CTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGC 

LeuTh rSe rMetLeuThr AspProSerH i s 1 1 eTh rA 1 aG 1 uA 1 aA 1 aG 1 yArgA rgLeu 
5161 TGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGT 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
5221 TGGCGAGGGGATCACCCCCGTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCAT 
ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

LeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGT 

LeuLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGluSerGluAsnLysVal 
5311 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC 

Va 1 1 1 eLeuAspSe rPhe AspProLeuVa 1 A 1 aG 1 uG 1 uAspG 1 uA rgG 1 u I leSe rVa 1 
5401 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCG 
ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

_ _ ProAlaGluI leLeuArgLysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArg 
5461 TACCCGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGC 
ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG 

ProAspTyrAsnProProLeuValGluThrTrpLysLysProAspTyrGluProProVal 
5521 GGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTG 
CCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGAC 

ValHisGlyCysProLeuProProProLysSerProProValProProProArgLysLys 
5581 TGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGA 
ACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCT 

„ , , . „ ArgTh rVa 1 Va 1 LeuThrG 1 uSerTh rLeuSe rTh rA 1 aLeuA 1 aG 1 uLeuA 1 aTh rA rg 
5641 AGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCA 
TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThrThrSerSerGlu 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTG 
CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAAT6CCCGCTGTTATGCTGTTGTAGGAGAC 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
5761 AGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCC 
TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

■ ^^LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 

5821 CCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTA 
GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

_„„, GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeu 
5881 GTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCAC 
CACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeu 
5941 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGT 
AGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCA 

LeuArgHisHIsAsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGA 
ACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 
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6061 AGAXA6TCACAmGA?A6?cf 6CM 1 U 

TCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCC 

_ „ ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSer 
6121 AGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCA 
TCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGT 

LeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCys 
6181 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTT 
CGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAA 

, HlsAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsn 

6241 GCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 
CGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

Va 1 Th rPro 1 1 eAspTh rThrl 1 eMe tA 1 aLy sAsnG 1 uVa 1 PheCys Va 1 G 1 nP roG 1 u 
6301 ATGTAACACCAATAGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyValArgValCys 
6361 AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGT 
TCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACA 

GluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSer 
6421 GCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCT 
CGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGA 

^^TyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSer 
6481 CCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGT 
GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThrValThrGlu 
6541 CCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTG 
GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGAC 

SerAspIleArgThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnAlaArg 
6601 AGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCC 
TCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGG 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProLeuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAA 
CGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCysGly 
6721 GGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCCCCCTCTTGACGCCGATAGCGTCCACGGCGC6CTCGCCGCATGACTGTTGATCGACAC 

AsnThrLeuThrCysTyrl leLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlnAsp 
6781 GTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGG 
CATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCC 

CysThrMetLeuValCysGlyAspAspLeuValVal I leCysGluSerAlaGlyValGln 
6841 ACTGCACCATGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCC 
TGACGTGGTACGAGCACACACCGCTGCTGAATCAG.CAATAGACACTTTCGCGCCCCCAGG 

GluAspAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProPro 
6901 AGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCeCC 
TCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGG 

Gl yAspProProG InProG 1 uTyrAspLeuG 1 uLeu 1 1 eThrSerCysSerSerAsnVa 1 
6961 CTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACG 
GACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGC 

„ SerValAlaHIsAspGlyAlaGlyLysArgValTyrTyrLeuThrArgAspProThrThr 

7021 TGTCAGTCGCCCACGACGGCGCTGGAAAGAG5GTCTACTACCTCACCCGTGACCCTACAA 
ACAGTCAGCGGGTGCT6CCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTT 



FIG. 47H 

™o, PfoLeuAlaArgAlaAlaTrpGluTh^ 
7081 CCCCCCTCGCGAGAGCTGCGTGG 

GGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATC 

AsnllelleMetPheAlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePhe 
7141 GCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCT 
CGTTGTATTAGTACAAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGA 

Se0alLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAla 
7201 TTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGG 
AATCGCAGGAATATCGGTCGCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCC 

^ CysTyrSerlleGluProLeuAspLeuProProIlelleGlnArgLeu 
7261 CCTGCTACTCCATAGAACCACTTGATCTAGCTCCAATCATTCAAAGACTC 
GGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 54A 

, ArgSerArgAsnLeuG lyLysVa 1 1 1 eAspThrLeuThrCysG 1 yPheA 1 aAspLeuMet 
1 AGGTCGCGCAATTTG6GTAAGGTCATCGATACCCTTACGTGCG6CTTCGCC6ACCTCATG 
TCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTAC 

a, ^TyriieProi^uy^^ 

61 GGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGGC 
CCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACCG 

ValArgValLeuGluAspGlyValAsnTyrAlaThrGlyAsnLeuProGlyCysSerPhe 
121 GTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTTC 
CAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAAG 

SerllePheLeuLeuAlaLeuLeuSerCysLeuThrValProAlaSerAlaTyrGlnVal 
181 TCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGTG 
AGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCAC 

o, •■ ArgAsnSerThrGlyLeuTyrHisValThrAsnAspCysProAsnSerSerl leValTyr 
241 CGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTAC 
GCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACATG 

GluAlaAlaAspAlalleLeuHisThrProGly^ 
301 GAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAAC 
CTCCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTTG 

^ AlaSerArgCysTrpValAlaMetThrProThrValAlaThrArgAspGlyLysLeuPro 
361 GCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCCC 
CGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGGG 

AlaThrGlnLeuArgArgHisIleAspLeuLeuValGlySerAlaThrLeuCysSerAla 
421 GCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGCC 
CGCTGCGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCGG 

f rt „ LeuTyrValGlyAspLeuCysGlySerValPheLeuValGlyGlnLeuPheThrPheSer 
481 CTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTCT 
GAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAGA 

,-, , PfoArgArgHisTrpThrThrGlnGlyCysAsnCysSerlleTyrProGlyHisIleThr 
541 CCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAACG 
GGGTCCGCGGTGACCTGCTCCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTGC 

^ GlyHi sArgMetA 1 aTrpAspMetMetMetAsnTrpSerProTh rTh rA 1 aLe uVa 1 Met 
601 GGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAATG 
CCAGTGGCGTACCGTACCCTATACTACTACTTGACCAGGGGATGCTGCCGCAACCATTAC 
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AlaGlnLeuLeuArglleProGlnAlalleLeuAspMetlleAlaGlyAlaHisTrpGly 
661 GCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGGA 
CGAGTCGACGAGGCCTAGGGTGTTCGGTAGAACCTGTACTAGCGACCACGAGJGACCCCT 

ValLeuAlaGlylleAlaTyrPheSerMetValGlyAsnTrpAlaLysValLeuValVal 
721 GTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGTG 
CAGGACCGCCCGTATCGCATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCAC 

LeuLeuLeuPheAlaGlyValAspAlaGluThrHisValThrGlyGlySerAlaGlyHis 
781 CTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCAC , 
GACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGTG 

ThrValSerGlyPheValSerLeuLeuAlaProGlyAlaLysGlnAsnValGlnLeuIle 
Bm ACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGATC 
TGACACAGACCTAAACAATCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTAG 

AsnThrAsnGlySerTrpHisLeuAsnSerThrAlaLeuAsnCysAsnAspSerLeuAsn 
901 AACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAAC 
TTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTTG 

ThrGlyTrpLeuAlaGlyLeuPheTyrHisHisLysPheAsnSerSerGlyCysProGlu 
961 ACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGAG 
TGGCCGACCAACCGTCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACTC 

ArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGlyProI leSerTyr 
1021 AGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTAT 
TCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAATA 

AlaAsnGlySerGlyProAspGlnArgProTyrCysTrpHisTyrProProLysProCys 
1081 GCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTGC 
CGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAACG 

GlylleValProAlaLysSerValCysGlyProValTyrCysPheThrProSerProVal 
1141 GGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGTG 
CCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCAC 

ValValGlyThrThrAspArgSerGlyAlaProThrTyrSerTrpGlyGluAsnAspThr 
1201 GTG6TGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATACG 
CACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATGC 

AspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPheGlyCysThrTrp 
1261 GACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTGG 
CTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGACC 

MetAsnSerThrG lyPheThrLysVa 1 CysG lyA 1 aProProCysVa 1 1 leG 1 yG lyA 1 a 
1321 ATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGCG 
TACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCGC 

GlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisProAspAlaThrTyr 
1381 GGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATAC 
CCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTATG 

SerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAspTyrProTyrArg 
mm TCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAG6TGCCTGGTCGACTACCCGTATAGG 
AGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATCC 

LeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArgMetTyrValGly 
1501 CTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGGA 
GAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCCT 

GlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGluArgCysAspLeu 
1561 GGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCTG 
CCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGAC 

GluAspArgAspArgSerGluLeuSerProLeuLeuLeuThrThrThrGlnTrpGlnVal 
1621 GAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGTC 
CTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTG6TGATGTGTCACCGTCCAG 



( 



FIG. 54C 

, „ n , LeuProCysSerPheThrThrLeuProA 1 aLeuSe rTh rG 1 yLeu 1 1 eH I sLeuH i sG 1 n 
1681 CTCCCGTGnCCnCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCAG 
GAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGTC 

,„ , AsnlleValAspValGlnTyrLeuTyrGlyValGlySerSerlleAlaSerTrpAlalle 
1741 AACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCATT 
TTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTAA 

LysTrpG 1 uTy rVa 1 Va 1 LeuLeuPheLeuLeuLeuA 1 a AspA 1 a A f gVa 1 CysSe rCys 
1801 AA6TGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGC 
TTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACG 

, , LeyI r P MetMetLeuLeu H eSe rG I nA 1 aG 1 uA 1 aA 1 aLeuG 1 uAsnLeuVa 1 1 1 eLeu 
1861 TTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTT 
AACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAA 

„ AsnA 1 aA 1 aSe rLeuA 1 aG 1 yTh rH i sG 1 yLeuVa 1 Se r PheLeuVa 1 PhePheCy sPhe 
1921 AATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTT 
TTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAA 

AiaTrpTyrLeuLysGiy^ 

1981 GCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGG 
CGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACC 

on,, ProLeuLeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAla 
2041 CCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTG 
GGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCAC 

AlaAlaSerCysGlyGlyValValLeuValGlyLeuMetAl-aLeuThrLeuSerProTyr 
2101 GCCGC6TCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATAT 
CGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATA 

o -, ^ , Jy^LYsArgTyr I leSe rTrpCysLeuTr pTrpLeuG 1 nTy rPheLeuTh rA rgVa 1 G 1 u 
2161 TACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAA 
ATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTT 

wn , A 1 aG 1 nLeuH i sVa 1 Trp IleProP roLeuAsnVa lArgGlyGl yA rgAspA 1 aVa 1 1 1 e 
2221 GCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATC 
CGCGTTGACGTGCACACCTAAGG6GGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAG 

nnn LeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLysLeuLeuLeuAla 
2281 TTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCC 
AATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGG 

0 , , , Va 1 PheG 1 yProLeuTrpI 1 eLeuG 1 nA 1 aSerLeuLeuLysVa 1 ProTy rPheVa 1 A rg 
2341 GTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGC 
CAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCG 

o, rt , ValGln61yLeuLeuArgPheCysAlaLeuAlaArgLysMetIleGlyGlyHisT^ 
2401 GTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTG 
CAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCAC 

„ ri GlnMetValllelleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThr 
2461 CAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACT 
GTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGA 

ncn , P™LeuAr gAspTrpA 1 aH i sAsnG 1 yLeuA rgAspLeuA 1 a Va 1 A 1 aVa 1 G 1 uP roVa 1 
2521 CCTCTTCGGGACTGGGCGCACAACCGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTC 
GGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAG 

CAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCAGGCCA 

„ , , Asp 1 1 e 1 1 eAsnG 1 yLeuProVa 1 Se rA 1 aA rgA rgG 1 yA rgG 1 u 1 1 eLeuLeuG 1 yP ro 
2641 GACATCATCAACGGCTTGCCTGmCCGCCCGCAGGGGCCGGG 

CTGTAGTAGTTGCCGAACGGAeAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCC6GT 
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„,~, MaAspG 1 yMetVa 1 Se rLysG 1 yTrpArgLeuLeuA 1 aPro I leTh r A 1 aTy rA 1 aG 1 n 
2701 6CCGATGGAATGGTCTCCAAG6GGTGGAGGTTGCTGGC 

CGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGGCGCATGCGGGTC 

„ c , 61 njh rA rgG lyLeuLeuG lyCys 1 1 e 1 1 eThrSe rLeuTh rG lyA rgAspLysAsnG 1 n 
2761 CAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGAC^ 

GTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTT 

ooo i 1 y^lyG 1 uValG 1 n I leVa 1 Se rTh rA 1 aA 1 aG 1 nThrPheLeuA 1 aThrCys 1 1 e 
2821 GTGGAGGGTGAGGTCCAGATTGTGT 

CACCTCCCACTCCAGGTCTAACACAGTTGAC6ACGGGTTTGGAAGGACCGTTGCACGTAG 

AsnGlyValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlleAlaSerProLys 
2881 MTG6GGTGTGCTG6ACTGTCTACCACGGBGCCG 

TTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTC 

oo * , ^jl^Proyijl lelG 1 nMetTy rThr AsnVa 1 AspG 1 nAspLeuVa 1 G 1 yTrpProA 1 aPro 
2941 GGTCCTGTCATCCAGATGW 

CCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGC 

Tnni ^l^M^f^rUyTDCProCysThrCysGlySerSerAspLeuTyrLeuValThr 
3001 CAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCT 

GTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGC 

m . , ArgH I s A 1 aAspVa 1 1 leProVa 1 A rg A rgArgG 1 yAspSerArgG 1 ySe rLeuLeuSer 

3i2i 

GGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCC 

* 1$ n y A ^AA^X^A^XRf 1 CysThrA rgG 1 yVa 1 A 1 aLysA 1 aVa 1 Asp 
3181 CACGCCGTGGGCATATTTAGGGCW 

GTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTG 

3241 IIIM^PT^I^^^^T^^^MP CATGAG ^ TCCCCGGTG TTCACGGATAACTCC 
AAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGG 

Se r Pr oProVa 1 Va 1 ProG 1 nSe rPheG 1 nVa 1 A 1 aH isLeuH i sA 1 aP roTh rG 1 ySe r 
3301 !£ICCACCAGTAGTGCCCCA^ 

AGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCG 

___ . G lyLysSerThrLys Va 1 ProA 1 aA 1 aTy rA laA 1 aG 1 nG lyTyrLysVa 1 LeuVa 1 Leu 
3361 GGCAAAAGCACCAAGGTCCW 

CCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAG 

A snPr oSe rVal A 1 a A 1 ajh rteuG 1 y PheG 1 yA 1 aTy rMetSerLy sA 1 aH i sG 1 y 1 1 e 
3421 AACCCCTCTGTTGCTGCAACACT^ 

TTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAG 

xn»i ^^^U%^9J\i r M^l^W l leThrThrGlySerProI leThrTyrSer 
CTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGG 

,r-, , Tt3j:Iyj: G JlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspI lei lei le 

TGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAA 

, cn , CysAspG 1 uCysH i sSe rThr AspA 1 aTh rSe r 1 1 eLeuG ly 1 1 eG 1 yTh rVa 1 LeuAsp 
3601 TGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGU 

ACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTG 
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G 1 n A 1 aG 1 uTh r A 1 aG 1 y A 1 aAr gLeuVa 1 Va 1 Leu A 1 aTh r A 1 aTh rP roP roG 1 y Se r 
3661 CAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTCC 
GTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGG 

ValThrValProHisProAsnl leGluGluValAlaLeuSerThrThrGlyGluI lePro 
3721 GTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCCT 
CAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGA 

PheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeuIlePheCys 
3781 TTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTGT 
AAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACA 

HisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGlylleAsnAla 
3841 CATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCC 
GTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGG 

Va 1 A 1 aTy r Ty r A r gG 1 y Le u As pVa 1 Se rVa 1 1 1 e ProTh r Se rG 1 y A s pVa 1 Va 1 Va 1 
3901 GTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTC 
CACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAG 

ValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVallleAspCys 
3961 GTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGC 
CACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACG 

AsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPheThrlleGluThr 
4021 AATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACA 
TTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGT 

IleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgGly 
4081 ATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTC6GGGCAGGACTGGCAGGGGG 
TAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCC 

LysProGlylleTyrArgPheValAlaProGlyGluArgProSerGlyMetPheAspSer 
4141 AAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTCG 
TTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAGC 

SerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThrProAlaGlu 
4201 TCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAG 
AGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTG 

ThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHis 
4261 ACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCAT 
TGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTA 

LeuG 1 uPheTrpG 1 uG 1 y Va 1 PheThrG 1 yLeuThrH i s 1 1 e AspA 1 aH i sPheLeuSe r 
4321 CTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATCC 
GAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAGG 

G 1 nTh r LysG 1 nSe rG 1 yG 1 u AsnLeuP r oTy r LeuVa 1 A 1 aTy rG 1 nA 1 aTh rVa 1 Cy s 
4381 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeu 
4441 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuH i sG 1 yProThrProLeuLeuTy rArgLeuG 1 yA 1 aVa 1 G 1 nAsnG 1 u 
4501 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGAA 
TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTT 

1 1 eTh rLeuThr H i sProVa 1 Th r LysTy r 1 1 eMetTh rCy sMe tSe rA 1 a AspLeuG 1 u 
4561 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGAG 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

ValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrCys 
4621 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGA6CAACCGCCGCAGGACCGACGAAACCGGCGCATAACG 
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LeuSe rTh rG 1 y Cys Va 1 Va 1 1 leVa 16 1 yA rgVa 1 Va 1 LeuSe rG 1 y Ly sProA 1 a 1 1 e 
4681 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAATC 
GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

IleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHis 
4741 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTG 

LeuProTy r I leG 1 uG 1 nG 1 yMetMetLeuA 1 aG 1 uG 1 nPheLy sG 1 nLysA 1 aLeuG ly 
4801 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGGC 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCG 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAspTrp 
4861 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGG 
GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACC 

GlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhel leSerGlyl leGlnTyr 
4921 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATAC 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATG 

LeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThr 
4981 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACA 
AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGly 
5041 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGGG 
CGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCC 

TrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeu 
5101 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
5161 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGG 
CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

Ty rG 1 .yA 1 aG 1 y Va 1 A 1 aG 1 yA 1 aLeuVa 1 A 1 aPheLys 1 1 eMetSe rG 1 yG 1 uVa 1 P ro 
5221 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValVal 
5281 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

GlyValValCysAlaAlalleLeuArgARgHisValGlyProGlyGluGlyAlaValGln 
5341 GGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCAG 
CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTC 

TrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyr 
5401 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTAC 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATG 

ValProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThr 
5461 GTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACC 
CACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGG 

GlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGly 
5521 CAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGT 
GTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCA 

SerTrpLeuArgAspI leTrpAspTrpI leCysGluValLeuSerAspPheLysThrTrp 
5581 TCCTGGCTAAGGGAGATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGG 
AGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACC 

LeuLy s A 1 aLy sLeuMe tProG 1 nLeuP roG 1 y 1 1 eP roPheVa 1 Se r Cy sG 1 nA rgG 1 y 
5641 CTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGGG 
GATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGAC6GTCGC6CCC 
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TyrLysGlyValTrpArgValAspGlylleMetHisThrArgCysHisCysGlyAlaGlu 
5701 TATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAG 
ATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTC 

HeThrGlyHisValLysAsnGly™^ 
5761 ATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAAC 
TAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTG 

MetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGlyProCysThrProLeuPro 
5821 ATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCT 
TACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGA 

AlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyrValGluIleArg 
5881 GCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAGG 
CGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCC 

GlnValGlyAspPheHisTyrValThrGlyMetThrThrAspAsnLeuLysCysProCys 
5941 CAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGC 
GTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACG 

G 1 nVa 1 ProSerProG 1 uPhePheTh rG 1 uLeuAspG 1 yVa 1 ArgLeuH i sA rgPheA la 
6001 CAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCG 
GTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACGC 

ProProCysLysProLeuLeuArgGluGluValSerPheArgValGlyLeuHisGluTyr 
6061 CCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATAC 
GGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTATG 

_ ProValGlySerGlnLeuProCysGluProGluProAspValAlaValLeuThrSerMet 
6121 CCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATG 
GGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTAC 

LeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGlySer 
6181 CTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCA 
GAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGT 

ProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThr 
6241 CCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACT 
GGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGA 

CysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArg 
6301 TGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGG 
ACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCC 

GlnGluMetGlyGlyAsnlleThrArgValGluSerGluAsnLysValVallleLeuAsp 
6361 CAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGAC 
GTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTG 

SerPheAspProLeuValAlaGluGluAspGluArgGluI leSerValProAlaGluI le 
6421 TCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATC 
AGGAAGCTAGGCGAACACCGCCTCCTCGTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAG 

LeuArgLysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsn 
6481 CTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAAC 
GACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTG 

£n9£n2k$yy a 1 G 1 uTh rTrpLysLy sProA spTy rG 1 uProProVa 1 Va 1 H i sG 1 y Cys 
6541 CCCCCGCTAGTGGAGAC6TGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGGTGT 
GGGGGCGATCACCTCTGCACCTTTTTCG6GCTGATGCTTGGTGGACACCAGGTACCGACA 

ProLeuProProProLysSerProProValProProProArgLysLysArgThrValVal 
6601 CCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGTC 
GGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAG 

LeuTh rG 1 uSerTh rLeuSe rTh rA 1 aLeuA 1 aG 1 uLeuA 1 aTh rA rgSe rPheG 1 ySer 
6661 CTCACT6AATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGfiAGCTTTGGCAGC 
GAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTCG 
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Se rSerTti rSerGlyll eTh r6 1 yAspAsnTh rTh rTh rSe r Se rG 1 uP roA 1 aP roSe r 
6721 TCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTCT 
AGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGA 

GlyCysProProAspSerAspAlaGluSerTyrSerSerMetProProLeuGluGlyGlu 
6781 GGCTGCGCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGAG 
CCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCGTC 

ProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSerGluAlaAsnAla 
6841 CCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGCG 
GGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCGC 

G 1 uAspVa 1 Va 1 CysCysSe rMetSe rTy rSe rTrpThrG lyA laLeuVa ITh rP roCy s 
6901 GAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTGC 
CTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACG 

AlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHis 
6961 GCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCAC 
CGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGTG 

AsnLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLysLysValThrPhe 
7021 AATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTT 
TTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAA 

AspARgLeuG 1 nVa 1 LeuAspSe rH i sTy rG 1 nAspVa 1 LeuLysG 1 uVa 1 LysA 1 aA 1 a 
7081 GACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCG 
CTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGC 

AlaSerLysValLysAlaAsnLeuLeuSerValGluGluAlaCysSerLeuThrProPro 
7141 GCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCCA 
CGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGGT 

HisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCysHisAlaArgLys 
7201 CACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAAG 
GTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAACGGTACGGTCTTTC 

AlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsnValThrProIle 
7261 GCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAATA 
CGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTAT 

AspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGluLysGlyGlyArg 
7321 GACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCGT 
CTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGCA 

LysProA laA rgLeu I leva lPheProAspLeuG lyVa 1 A rgVa 1 CysG 1 uLysMetA la 
7381 AAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGCT 
TTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCGA 

LeuTyrAspValValThrLysLeuProLeuAlaValMetGlySerSerTyrGlyPheGln 
7441 TTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCAA 
AACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGTT 

TyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSerLysLysThrPro 
7501 TACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCA 
ATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGT 

MetGlyPheSerTyrAspThrArgCysPheAspSerThrValThrGluSerAsplleArg 
7561 ATGGGGmTCGTATGATACCCGCTGCmGAaCCACAGTCACTGAGAGCGACATCCGT 
TACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCA 

ThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnAlaArgValAlalleLys 
7621 ACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAG 
TGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGG6CGCACCGGTAGTTC 

SerLeuTh rG 1 uA rgLeuTy rVa 1 G 1 yG 1 yProLeuTh r AsnSe r A rgG 1 yG 1 uAsnCys 
7681 TCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTGC 
AGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACG 
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„ , , GlvTyrArgArgCysArgAlaSe rG 1 yVa 1 LeuTh rTh rSe rCy sG 1 yAsnTh rLeuTh r 
7741 GGCTATCGCAGGTGCCGCGCGAGCGG 

CCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGA 

7oni CysTyrUeLysAlaArgAlaAl^^ 

7801 TGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCTC 
ACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAG 

^ ocl Va 1 CysG 1 yAspAspLeuVa 1 Va 1 1 1 eCysG 1 uSe r A laG 1 yVa 1 G 1 nG 1 uAspA laA la 
7861 GTGTGTGGCGACGACTTAGTCGTm 

CACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGC 

SerLeuArg/UaP|^ 
7921 AGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCCA 
TCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGGT 

7noi G 1 nProG 1 uTy r AspLeuG 1 uLeuI 1 eTh rSe rCysSe rSer AsnVa lSerVa lAlaHis 
7981 CAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCAC 
GTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGTG 

on, i AspGlyAlaGlyLysArgVamrTyrLeuThrAr^ 

8041 GACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGA 
CTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCT 

AlaAlaTrPGluThrAlaArgHisThrProValAsnSerTrpLeuG 
8101 GCTGCGTGGGAGACAGCAAGAC^ 

CGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTAC 

„ , r , PheA laProTh rLeuTrpA 1 aA rgMet 1 1 eLeuMetTh r H i sPhePheSe rVa 1 Leu 1 1 e 
8161 TTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTATA 
AAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATAT 

„ 0 „, AlaArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlle 
8221 GCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCATA 
CGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTAT 

nnn ^ GluProLeuAspLeuProProI lei leGlnArgLeu 
8281 GAACCACTTGATCTACCTCCAATCATTCAAAGACTC 
CTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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FIG. 62A 

CACTCCACCATGAATCACTCCCCTGT6AGGAACTACTGTCTTCACGCAGAAAGCGTCTAG 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 

—(Putative initiator methionine codon) 
GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 
GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
^J^I9II^I TCTGGCCCTGCTC TCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT-900 
GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 
CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 
CGC.GACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 
CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC-1200 
TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 
GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 
GGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGG 
AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 
GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA-1500 
CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 
CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 
CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 
GAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 
TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG-1800 
CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 
GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 
GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTG 
GATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGC 
GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA-2100 
CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 
GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGG 
AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 
GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGT 
CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA-2400 
GAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCAT 
TAACTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 
CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 
TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 
TGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTG-2700 
GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 
GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 
TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 
AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
CTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGC-3000 
CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 
CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 
GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 
TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 
CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGG-3300 
TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 
AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCA 
GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 
AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 



FIG. 62B 

CAATG6GGTGT6CT6GACTGTCTACCACGGG6CCGGAAC6AGGACCATCGCGTCACCCAA-3600 
GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 
GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 
GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 

GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 

GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA-3900 

CTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 

CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAG 

CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 

CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 

CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC-A200 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCAGGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 

TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG-^1500 

TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 

CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 

AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG-^800 

GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 

GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 

TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 

CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG-5100 

CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 

AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 

GGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTG 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT-5400 

CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 

CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 

CCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 

GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 

CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC-5700 

AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 

GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC _ 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT-6000 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 

CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 

TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTG-6300 



FIG. 62C 

GCTAAAA6CTAAGCTCATGCCACA6CTGCCTG6GATCCCCTTTGTGTCCTGCCAGCGCGG 

6TATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGA 

GATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAA 

CATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC 

TGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAG-6600 

GCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTG 

CCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 

GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 

CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCAT 

GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATC-6900 

ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 

TTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAG 

GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGA 

CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT 

CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAA-7200 

CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 

TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 

CCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAG 

CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 

TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGA-7500 

GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGC 

GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTG 

CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCA 

CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATT 

TGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGC-7800 

GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 

ACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTTGCGATGCCAGAAA 

GGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACGAAT 

AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG 

TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC-8100 

TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 

ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 

AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 

TACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 

GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTG-8400 

CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 

TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 

CGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 

GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCC 

ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCA-8700 

CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 

AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 

GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 

AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCAT 

AGAACCACTTGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACT-9000 

CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 

ACCGCCCTTGCGAGCTTGGAGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 

AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 

CAAAC 



FIG. 62D 

1 CACTCCACCATGAATCACTCCCCT6T6AGGAACTACT6TCTTCACGCAGAAAGC6TCTAG 
GTGAGGTGGTACTTAGTGAGGGGACACTCCTTGATGACAGAAGTGCGTCTTTCGCAGATC 

61 CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GGTACCGCAATCATACTCACAGCACGTCGGAGGTCCTGGGGGGGAGGGCCCTCTCGGTAT 

121 GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
CACCAGACGCCTTGGCCACTCATGTGGCCTTAACGGTCCTGCTGGCCCAGGAAAGAACCT 

181 TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
AGTTGGGCGAGTTACGGACCTCTAAACCCGCACGGGGGCGTTCTGACGATCGGCTCATCA 

241 GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG 
CAACCCAGCGCTTTCCGGAACACCATGACGGACTATCCCACGAACGCTCACGGGGCCCTC 

301 GTCTCGTAGACCGTGCACCATGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CAGAGCATCTGGCACGTGGTACTCGTGCTTAGGATTTGGAGTTTTTTTTTTGTTTGCATT 

361 CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
GTGGTTGGCAGCGGGTGTCCTGCAGTTCAAGGGCCCACCGCCAGTCTAGCAACCACCTCA 

121 TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
AATGAACAACGGCGCGTCCCCGGGATCTAACCCACACGCGCGCTGCTCTTTCTGAAGGCT 

481 GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
CGCCAGCGTTGGAGCTCCATCTGCAGTCGGATAGGGGTTCCGAGCAGCCGGGCTCCCGTC 

541 GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC 
CTGGACCCGAGTCGGGCCCATGGGAACCGGGGAGATACCGTTACTCCCGACGCCCACCCG 

601 GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
CCCTACCGAGGACAGAGGGGCACCGAGAGCCGGATCGACCCCGGGGTGTCTGGGGGCCGC 

661 TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
ATCCAGCGCGTTAAACCCATTCCAGTAGCTATGGGAATGCACGCCGAAGCGGCTGGAGTA 

721 GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CCCCATGTATGGCGAGCAGCCGCGGGGAGAACCTCCGCGACGGTCCCGGGACCGCGTACC 

781 CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAACCTTCCTGGTTGCTCTTT 
GCAGGCCCAAGACCTTCTGCCGCACTTGATACGTTGTCCCTTGGAAGGACCAACGAGAAA 

841 CTCTATCTTCGTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT 
GAGATAGAAGGAAGACCGGGACGAGAGAACGAACTGACACGGGCGAAGCCGGATGGTTCA 

901 GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 
CGCGTTGAGGTGCCCCGAAATGGTGCAGTGGTTACTAACGGGATTGAGCTCATAACACAT 

961 CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 
GCTCCGCCGGCTACGGTAGGACGTGTGAGGCCCCACGCAGGGAACGCAAGCACTCCCGTT 

1021 CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 
GCGGAGCTCCACAACCCACCGCTACTGGGGATGCCACCGGTGGTCCCTACCGTTTGAGGG 

1081 CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 
GCGCTGCGTCGAAGCTGCAGTGTAGCTAGACGAACAGCCCTCGCGGTGGGAGACAAGCCG 

1141 CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC 
GGAGATGCACCCCCTGGATACGCCCAGACAGAAAGAACAGCCGGTTGACAAGTGGAAGAG 

1201 TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 
AGGGTCCGCGGTGACCTGCTGCGTTCCAACGTTAACGAGATAGATAGGGCCGGTATATTG 



FIG. 62E 

1321 GGCTCAGCTGCTCCGGATCCCACAAGCCATCTT6GACATGATCGCT6GTGCTCACTGGGG 
CCGAGTCGACGAGGCCTAGGGTGTTCGGTAGAACCTGTACTAGCGACCACGAGTGACCCC 

1381 AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGG 

TCAGGACCGCCCGTATCGCATAAAGAGGTACCACCCCTTGACCCGCTTCCAGGACCATCA 

1111 GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA 
CGACGACGATAAACGGCCGCAGCTGCGCCTTTGGGTGCAGTGGCCCCCTTCACGGCCGGT 

1501 CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 
GTGACACAGACCTAAACAATCGGAGGAGCGTGGTCCGCGGTTCGTCTTGCAGGTCGACTA 

1561 CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 
GTTGTGGTTGCCGTCAACCGTGGAGTTATCGTGCCGGGACTTGACGTTACTATCGGAGTT 

1621 CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTG 
GTGGGCGACCAACCGTCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAGTCCGACAGGACT 

1681 GAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 
CTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCCCGGGATAGTCAAT 

1711 JGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAAC 

ACGGTTGCCTTCGCCGGGGCTGGTCGCGGGGATGACGACCGTGATGGGGGGTTTTGGAAC 

1801 CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 
GCCATAACACGGGCGCTTCTCACACACACCAGGCCATATAACGAAGTGAGGGTCGGGGCA 

1861 GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACA 

CCACCACCCTTGCTGGCTGTCCAGCCCGCGCGGGTGGATGTCGACCCCACTTTTACTATG 

1921 GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGC 

CCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCAAGCCAACATGGAC 

1981 GATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGC 
CTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACACAGTAGCCTCCCCG 

2011 GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA 
CCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAGGCCTGCGGTGTAT 

2101 CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCA 

GAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGCTGATGGGCATATC 

2161 rrIIT^rTllTT^TT^TTp^^^^^^^^^^^^^ AT ^AAAATCAGGATGTACGTGGG 
CGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGTCCTACATGCACCC 

2221 AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAAC 

TCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGCTTGCAACGCTAGA 

2281 GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTA 

CCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGATGTGTCACCGTCCA 

2311 CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA 
GGAGGGCACAAGGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGTAGGTGGAGGTGGT 

2101 GAACATTGTGGACGTGCAGTACTTGTACGG 

CTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGCGCAGGACCCGGTA 

2161 TAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 
ATTCACCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGAC 

2521 CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 
GAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGA 

2581 TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 



FIG. 62F 

2641 TGCATG6TATTTGAAGGGTAAGTG66T6CCCGGAGCG6TCTACACCTTCTACGG6AtGTG 
ACGTACCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACAC 

2701 GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 
CGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCA 

2761 GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 
CCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTAT 

2821 TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 
AATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCT 

2881 AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 
TCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTA 

2941 CTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGC 
GAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCG 

3001 CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 
GCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGC 

3061 CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 
GCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCA 

3121 GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 
CGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTG 

3181 TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 
AGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCA 

3241 CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGG 
GCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCC 

3301 TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 
ACTGTAGTAGTTGCCGAAGGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGG 

3361 AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCAGGGCGTACGCCCA 
TCGGCTACCTTACCAGAGGTTCCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGT 

3421 GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 
CGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGT 

3481 AGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 
TCACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTA 

3541 CAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAA 
GTTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTT 

3601 GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 
CCCAGGACAGTAGGTCTACATATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGG 

3661 GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 
GCTTCCATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTG 

3721 GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 
CTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAG 

3781 GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG 
CGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCC 

3841 GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA 
CGTGCGGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCT 

3901 CTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 



FIG. 62G 

3961 CTCTCCACCAGTA6TGCCCCAGAGCTTCCAG6TGGCTCACCTCCATGCTCCCACAGGCA6 . 
GAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTC - 

4021 CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 
GCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGA 

4081 CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 
GTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTA 

4141 CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC 
GCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAG 

4201 CACCTACGGCAAGTTGCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 
GTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTA 

4261 TTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 
AACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACT 

4321 CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACGCCTCCGGGCTC 
GGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAG 

4381 CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 
GCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGG 

4441 TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG 
AAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGAC 

4501 TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 
AGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACG 

4561 CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 
GCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCA 

4621 CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 
GCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGAC 

4681 CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 
GTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTG 

4741 AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG 
TTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCC 

4801 GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 
CTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTACAAGCTGAG 

4861 GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 
CAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCT 

4921 GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 
CTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGT 

4981 TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 
AGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTGAAAGATAG 

5041 CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG 
GGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACAC 

5101 CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 
GCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGA 

5161 CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 
GTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACT 

5221 AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 



FIG. 62H 

5281 6GTC6TCAC6A6CACCT6G6T6CTC6TT66C66C6TCCTGGCT6CTTT6GCC6C6TATT6 
CCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAAC 

5341 CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT 
GGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTA 

5401 CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 
GTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGT 

5461 CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 
GAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCC 

5521 CCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 
GGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGAC 

5581 GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 
CGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTAT 

5641 CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC 
GAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATG 

5701 AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 
TCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCC 

5761 GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 
CACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAA 

5821 AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 
TCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCC 

5881 GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 
CATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGG 

5941 CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT 
GAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCA 

6001 CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 
GCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGT 

6061 GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 
CACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGAT 

6121 CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 
GCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTG 

6181 CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 
GGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCC 

6241 TTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTG 
AAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGAC 

6301 GCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGTGTCCTGCCAGCGCGG 
CGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCC 

6361 GTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGA 
CATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACT 

6421 GATCACT6GACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAA 
CTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTT 

6481 CATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC 
GTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATG6GGGGAAGG 

6541 TGCGCCGAACTACACGTTr.GC.Gf/TATGGAGGGTGTf.TGr.AGAGGAATATGTGGAGATAAG 



FIG. 621 

6601 GCAG6TGGG6GACTTCCACTACGTGACG66TATGACTACTGACAATCTCAAATGCCCGT6 
CGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCAC 

6661 CCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 
GGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAAACG 

6721 GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 
CGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCTTAT 

6781 CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCAT 
GGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAGGTA 

6841 GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATC 
CGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAG 

6901 ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 
TGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTG 

6961 XI^^^PTtt^^I^^^^^^^^^^^^^^TCATAGAGGCCAACCTCCTATGGAG 
AACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTC 

7021 GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGTGGTGATTCTGGA 
CGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCT 

7081 CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT 
GAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTA 

7141 CCTGCGGAAGTCTCGGAGATT^ 

GGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATT 

7201 CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 
GGGGGGCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGAC 

7261 TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 
AGGCGAAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCA 

7321 CCTCACTGAATCAACCCTAT 

GGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCGTC 

7381 CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 
GAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAG 

7441 TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGA 
ACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCT 

7501 GCCTGGGGATCCGGATCTTAGCGACGGGTCAT^ 

CGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCATCACTCCGGTTGCG 

7561 GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTG 
CCTCCTACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCAC 

7621 CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCA 
GCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGT 

7681 CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATT 
GTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAA 

7741 TGACAGACTGCAAGTTCTGGACAG^ 

ACTGTCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCG 

7801 GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 
CCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCCTTCGAACGTCGGACTGCGGGGG 

78fii Ar.Ar.Tr.flfir.r.AAflTr.r.AAfiTTTfifiTTATfifififirAAAAfiAr.rnTrr.fiTTRrrATfirr.AfiAAA 



FIG. 62J 

7921 GGCCGTAACCCACATCAACTCCGTGTG6AAAGACCTTCTGGAAGACAATGTAACACCAAT 
CCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGTTACATTGTGGTTA 

7981 AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG 
TCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGACTCTTCCCCCCAGt 

8041 TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC 
ATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACACGCTTTTCTACCG 

8101 TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 
AAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGAGGATGCCTAAGGT 

8161 ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 
TATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGG 

8221 AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 
TTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGC 

8281 TACG'GAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA 
ATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGGCGCACCGGTAGTT 

8311 GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTAGCAATTCAAGGGGGGAGAACTG 
CAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGAC 

8401 CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 
GCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTG 

8461 TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 
AACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGA 

8521 CGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 
GCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCG 

8581 GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCC 
CTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGGGACCCCTGGGGGG 

8641 ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCA 
TGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGCACAGTCAGCGGGT 

8701 CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 
GCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTC 

8761 AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 
TCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTA 

8821 GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 
CAAACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATA 

8881 AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCCTGCTACTCCAT 
TCGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTA 

8941 AGAACCACTTGATCTACCTCCAATCATTCAAAGACTCCATGGCCTCAGCGCATTTTCACT 
TCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAGGTACCGGAGTCGCGTAAAAGTGA 

9001 CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 
GGTGTCAATGAGAGGTCCACTTTAATTATCCCACCGGCGTACGGAGTCTTTTGAACCCCA 

9061 ACCGCCCTTGCGAGCTTGGAGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 
TGGCGGGAACGCTCGAACCTCTGTGGCCCGGGCCTCGCAGGCGCGATCCGAAGACCGGTC 

9121 AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 
TCCTCCGTCCCGACGGTATACACCGTTCATGGAGAAGTTGACCCGTCATTCTTGTTTCGA 

9181 CAAAC 
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TRANSFORM E coli WITH RECOMBINANT PLASMIDS 
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NT. = EXPRESSION NOT TESTED 
t THIS POLYPEPTDE WAS NEGATIVE IN THIS 
COLONY SCREEN BUT POSITIVE BY WESTERN 
BLOT ANALYSIS 



FIG. 65 



FIG. 66 A 



R T 

MSTNPKPQKKNKRNTNRRPQDVKFP666QIV6GVYLLPRRGPRLGVRATR 

KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 

RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

T 

LAHGVRVLEDGVNYATGNLPGCSFS I FLLALLSCLTVPASA YQVRNSTGL-200 
YH VTNDCPNSS I VYEAADA I LHTPGCVPCVREGNASRCWVAMTPTVATRD 
GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS I YPGHI TGH RMAWDMMMNWSPTTA L VM AQLLR I PQA I LDM I AG 

AHWGVLAGIAYFSIWGNWAKVLVVLLLFAGVDAETHVTGGSAGHTVSGFV-400 

SLLAPGAKQNVQL I NTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS 

GCPERLASCRPLTDFDQGWGPISYANGSGPDQRPYCWHYPPKPCGIVPAK-500 

SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

I 

PWLTPRCLVDYPYRLWHYPCTI NYTI FKI RMYVGGVEHRLEAACNWTRGE 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGLIHLHQNIVDVQ-700 
YLYGVGSS I ASWA I KWEYVVLLFLLLADARVCSCLWMMLL I SQAEAALEN 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRYISWCLWWLQYFL 
TRVEAQLHVWIPPLNVRGGRDAVI LLMCAVHPTLVFDITKLLLAVFGPLN-900 
ILQASLLKVPYFVRVQGLLRFCALARKMIGGHYVQIWI IKLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEFVVFSQMETKLITWGADTAACGDI INGL-1000 
PVSARRGREILLGPADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR 
DKNQVEGEVQ I VSTAAQTTFATC I NGVCWTVYHGAGTRTI ASPKGPV I QM- 1 100 
YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 
SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 
LETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

L 

VLVLNPSVAATLGFGAYMSKAHG I DPN I RTGVRT I TTGSP I TYSTYGKFL-1300 

ADGGCSGGAYDI I ICDECHSTDATSILGIGTVLDQAETAGARLVVLATAT 
PPGSVTVPHPN I EEVALSTTGE I PFYGKA I PLEV I KGGRHL I FCHSKKKC-1400 
DELAAKLVALGINAVAYYRGLDVSVIPTSGDVVVVATDALMTGYTGDFDS 

Y (S) 
VI DCNTCVTQTVDFSLDPTFT I ET I TLPQDAVSRTQRRGRTGRGKPG I YR- 1 500 
FVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCL I RLKPTLHGPTPLL YRLGAVQNE I TLTHPVTKY I MTCMS 
ADLEVVTSTWVLVGGVL AAL AA YCLSTGCVV I VGRVVLSGKPA 1 1 PDREV- 1 700 
LYREFDEMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVIAPAV 
QTNWQKL ETFWAKHMWNF I SG I QYL AGLSTLPGN P A I ASLMA FTAAVTSP-1 800 
LTTSQTLLFN I LGGWVA AQLAAPGAATAFVGAGL AGAA I GSVGLGKVL I D 



FIG. 66B 

(G) 

ILAGYGAGVAGALVAFKIMSGEVPSTEDLVNLLPAILSPGALVVGVVCAA-1900 

( HC ) 

ILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTAILSS 
LTVTQLLRRLHQWISSECTTPCSGSWLRDIWDWICEVLSDFKTWLKAKLM-2000 

(V) 

PQLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 

TCRNMWSGTFPINAYTTGPCTPLPAPNYTFALWRVSAEEYVEIRQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 

VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

S 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-2400 

SDGSWSTVSSEANAEDVVCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA-2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKI v lALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SD I RTEEAI YQCCDLDPQARVA I KSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLVVICESAGVQ 
EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTFVNSWLGNIIMFAPTLWARMILMTHFF 
SVLIARDQLEQALDCEIYGACYSIEPLDLPPI IQRLHGLSAFSLHSYSPG-2900 

G 

EI NRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAA I CGKYLFNWAV 
RTKLK — (Stop codon not yet reached) 



( ) = Heterogeneity due to possible 5' or 3' 
terminal cloning artefacts. 
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Flaviviruses 
(Yellow Fever, 
West Nile, Dengue) 



NS3 region 
TATPP6 SAAQRRGRIGRNP- 



HCV 



TATPPG SRTQRRGRTGRGK- 

I I 
#1348 #1483 



NS5 

Highly-conserved 
Polymerase 
region 

GDDCVV 



--GDDLVV 
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#2737 



FIG. 73 



5' 3' 
CCGGGCGAGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAAC 
CGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTG 
3' 5' 

5' 3' 
CATGTTTCCCCCTAATGAG 
GTACAAAGGGGGATTACTCAGC 
3' 5' 
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Hindu 9400 
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GluPheGlyS TVallleProThrSerGlyAspValValValValAlaThrAspAlaLeu 
1 GAATTCGGGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTC 
CTTAAGCCCAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAG 

AA A A AAA 

1 EC0R1, 7 NLA1V, 8 AVA2 SAU96 , 15 F0K1, 24 NSPB11, 26 FNU4H 
1, 52 SFAN1, 57 MNL1, 60 N1A111, 

MetThrGlyTyrThrGlyAspPheAspSerVallleAspC^sAsnThrCysValThrGln 
6 1 ATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAG 
TACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTC 

A A AAA A AA 

65 HPA11, 74 HPA11, 83 TAQ1, 85 HINF1, 90 HPH, 106 AFL111 MA 
E2, 112 MAE3, 113 HPH, 

ThrValAspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAsp 
121 ACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGACAATCACGCTCCCCCAAGAT 
TGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTTCTA 

A 4 A A 

125 TAQ1, 149 HPH, 178 SFAN1, 

AlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArg 
181 GCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGA 
CGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCT 

A A 

198 MAE2, 226 EC0R11 SCRF1, 230 SFAN1, 

PheValAlaProGlyGluArgProProAlaCysSerThrArgProSerSerValSerAla 
241 TTTGTGGCACCGGGGGAGCGCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTG AGTGCC 
AAACACCGTGGCCCCCTCGCGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGG 

•> A AA A A AA A A A A 

246 BAN1 NLA1V, 250 HPA11 NCI1 SCRF1, 257 HAE11, 258 HHA1, 2 
62 MNL1, 265 HPA11, 268 NSPC1, 269 NLA111, 274 TAQ1, 276 HIN 
Fl, 287 MNL1, 296 BSP1286, 

Arglle 
301 CGAATTC 
GCTTAAG 

302 EC0R1, 
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AlaValAspPhelleProValGluAsnLeuGluThrThrMetArgSerProValPheThr 
2 GCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACG 
CGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGC 

29 MAE1, 40 NLA111, 43 MNL1, 45 AVA2 NLA1V SAU96, 49 NCIl SC 
RF1, 50 HPA11/ 

AspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAlaPro 
6 2 GATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCC 
CTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGG 

A A A A AAA. 

69 MNLl/ 83 BSP1286, 92 ALUl, 97 EC0R11 SCRF1, 106 HPH, 109 
MNL1 , 113 NLA111, 

ThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVal 
122 ACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTG 
TGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCAC 

AA A A AA A A AAA 

126 BBV FNU4H1/ 127 NSPB11/ 129 FNU4H1, 145 AVA2 NIA1V SAU96 
, 148 NCIl SCRF1, 149 HPA11/ 152 BBV FNU4H1, 156 NDE1/ 161 B 
BV FNU4H1, 163 ALUl/ 165 DDE1, 

LeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAla 
182 CTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCT 
GATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGA 

A AA A A AA 

182 MAEl/ 184 SCA1, 185 RSA1, 195 MNLl/ 203 BBV FNU4H1, 228 
AFL111 NSPC1, 229 NLA111/ 

Hi sG ly I leAspP roAsn IleAr gThrGlyValAr gThr I leThrThrGly Ser P roll e 
242 CATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATC 
GTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAG 

A .A****^^** A A A A A 

242 NLA111, 246 BIN1, 247 MBOl SAU3A, 248 CLA1, 249 TAQ1/ 25 
1 BIN1 MBOl SAU3A, 264 AVA2 SAU96, 267 HPA11 NCIl SCRF1, 271 
HPH, 291 BBV FNU4H1, 

ThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAsp 
302 ACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGAC 
TGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTG 

302 MAJB2/ 304 RSA1, 340 BSP1286 HGIA, 343 AVA1, 350 HAE11, 3 
51 HHA1/ 

IlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThr 
362 ATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATTGGCACT 
TATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTGA 

A AAA 

372 MAE3, 391 F0K1, 392 SFAN1, 399 F0K1/ 

ValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThrPro 
4 22 GTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCT 
CAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGA 

A A A 

431 TTHIII2, 435 ALWN1, 461 BSP1286 HGIA, 479 MNLl/ 



# 

FIG. 79B 

ProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThrGly 
482 CCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGA 
GGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCT 

A A A AAA AAA A 

482 HPA11 NCI1 SCRF1, 484 BAN11 BSP1286, 485 NLA1V, 491 MAE3 
, 497 BSP1286, 503 F0K1, 513 TAQ1, 515 MNL1, 518 MNL1, 537 H 
PA11/ 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
542 G AGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTC 
CTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAG 

A A ^ A 

543 XH02, 544 BIN1 MB01 SAU3A, 571 MNL1, 573 TAQ1, 

IlePheCysHisSerLysLysLysC^sAspGluLeuAlaAlaLysLeuValAlaLeuGly 
602 ATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGC 
TAG AAG ACAGTAAG TTTCTTCTTCACGCTGCTTG AGCGGCGTTTCG ACCAGCGTAACCCG - 

A A A A A 

603 MB Oil, 619 MB011, 638 FNU4H1, 645 ALU1, 660 SFAN1, 

IleAsnAlaValAlaTyxTyrArgGlyLeuAspValSerVallleProThrSerGlyAsp 
662 ATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGAT 
TAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTA 

AA AA A A A A 

672 HAE1, 673 HAE111, 682 NSPB11 SAC2, 683 THAI, 693 AFL111 
MAE2, 703 F0K1, 712 NSPB11, 714 FNU4H1, 

ValValValValAlaThrAspAlaLeuMetThrGlyTyrThrGlyAspPheAspSerVal 
722 GTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTG 
CAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCAC 

AAAA A AAA 

740 SFAN1, 745 MNL1, 748 NLA111, 753 HPAll, 762 HPAll, 771 T 
AQ1/ 773 HINF1, 778 HPH/ 

IleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPheThr 
782 ATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACC 
TATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGG 

A AA A A 

794 AFL111 MAE2, 800 MAE3, 801 HPH, 813 TAQ1, 837 HPH, 

IleGluThrlleThrLeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThr 
842 ATTGAGACAATCACGCTCCCCCAAGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACT 
TAACTCTGTTAGTGCGAGGGGGTTCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGA 

A A 

866 SFAN1/ 886 MAE2, 

GlyArgGlyLysProGlylleTyrArgPheValAlaProGlyGliiArgProSerGlyMet 
902 GGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATG 
CCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCGTAC 

A A A A AA AAA AA 

914 EC0R11 SCRF1, 918 SFAN1, 934 BAN1 NLA1V, 938 HPA11 NCI1 
SCRF1, 945 HAE11, 946 HHA1, 948 BGL1, 951 MNL1, 954 HP All, 9 
57 NSPC1, 958 NIA111, 

PheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThr 
962 TTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACG 
AAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGC 

A A A A AA 

963 TAQ1, 965 HINF1, 976 MNL1, 992 HGA1, 1003 TTHIII2, 1013 
BAN11 BSP1286 HGIA SAC1, 1014 ALU1, 

ProAlaGluThrThrVaLArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCys 
1022 CCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGC 
GGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACG 



FIG. 79C 



1051 RSA1, 1054 NLA111, 1063 AVAl NCI1 SCRF1 SMA1, 1064 HPA1 
1 NCI1 SCRF1, 1081 ECOR11 SCRF1, 

GlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAlaHis 
1082 CAGGACCATCTTGAATTTTGGG AGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCAC 
GTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGGGTG 

^ A AA AA A A 

1084 AVA2 SAU96, 1103 MNL1, 1106 AHA11/ 1107 HGA1, 1117 HAE1 
STUl, 1118 HAE111, 1120 MNL1, 1133 SFAN1, 

Phel^uSerGlnThrLysGlnSerGlyGluAsnl^uProTyrl^uValAlaTyrGlnAla 
1142 TTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCC 
AAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT^ACCATCGCATGGTTCGG 

1183 ECOR11 SCRF1, 1192 RSA1, 1201 DRA3, 

ThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeu 
1202 ACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTG* 
TGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAAC 

AAA A AA «•* "*"» 

1209 HHAl, 1212 MAE1, 1215 BAN 11 BSP1286, 1226 MNL1, 1239 NL 
A1V, 1240 AVA2 SAU96 , 1256 TTHIII2, 1261 HINF1, 

IleArgLeuLysProThrLeuHlsGlyProThrProLeuLeuTyrArgLeuGlyAlaVal 
1262 ATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTT 
TAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAA 

A A AA AA AA 

1267 MHL1, 1279 MNL1, 1282 NC01, 1283 NLA111, 1286 SAU96, 12 
87 HAE111, 1313 HAE11, 1314 HHAl, 

GlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAla 
1322 CAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCC 
GTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGG 

0* A AA A AAA AA AA 

1332 HPH, 1339 HGA1, 1349 MAE3, 1350 HPH/ 1363 NIAlll, 1367 
NSPC1, 1368 NLAUl, 1369 AVA3 NSIl, 1371 NSPC1, 1372 NLAlll, 
1377 CFR1 XMA3 , 1378 HAE111, 

AspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLexiAla 
1382 GACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCC 
CTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGG 

AA A A A A A A ~* A A A 

1384 EC0R11 SCRF1, 1385 GSU1, 1388 MNL1, 1394 MAE3, 1399 BSP 
1286 HGIA, 1404 EC0R11 SCRF1, 1409 BSP1286 HGIA, 1419 FNU4H1 
, 1421 AHA11, 1422 HGA1, 1426 EC0R11 SCRF1, 1430 BBV FNU4H1, 
1437 CFR1, 1438 HAE111, 1439 FNU4H1, 1441 THAI, 

AlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGlyLys 
1442 GCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAG 
CGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTC 

A A 

1453 HINC11, 1461 BBV FNU4H1, 1494 HPA11 NCI1 SCRP1/ 1501 NA 
El, 

ProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCys 
1502 CCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGC 
GGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACG 

A A ^ ** 

1502 HPA11, 1528 MNLl, 1542 TAQ1, 1553 MB011, 1558 BSP1286 H 
GIA, 

ScrGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLya 
1562 TCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAG 
AGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAACTTCGTCTTC 

1563 DDE1, 1576 RSA1, 1581 TAQ1, 1590 FOK1, 1594 SFANl, 1612 




TTHIII2, 1621 HAE111 SAU96, FIG. 79D 

AlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGln 
1622 GCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCGAG 
CGGGAGCCGGAG^CGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTC 

1624 MNL1, 1628 HAE111, 1630 MNL1, 1634 PST1, 1639 TTHIII1, 
1642 THAI, 1643 HGA1, 1658 MNL1, 

ThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGly 
1682 ACCAACTGGCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGG 
TGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCC 

1697 AVA1 XHOl, 1698 TAQ1, 1718 NDE1, 

IleGlnTyrl^uAlaGlyl^uSerThrLeuProGlyAsnProAlalleAlaSerLeuMet 
1742 ATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATG 
TATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTAC- 

1762 HINC11, 1768 BBV FNU4H1, 1772 ECOR11 SCRF1, 1775 BSTE2, 
1776 MAE3/ 

AlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlle 
1802 GCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATA 
CGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTAT 

«"» «"» A *"» A AAA 

1809 ALWN1 NSPB11 PVU11, 1810 ALU1, 1811 BBV FNU4H1, 1817 MA 
E3, 1818 HPH, 1836 MAE1, 1846 MNLl, 1849 MNL1, 1851 MB011, 

LeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheValGly 
1862 TTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGC 
AACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCG 

^ * A AA A A AA 

1877 BBV FNU4H1, 1884 ALU1, 1889 FNU4H1, 1895 NCI1 SCRF1, 18 
96 HPA11, 1898 BAN1 NLA1V, 1901 FNU4H1, 1919 HAE11, 1920 HHA 



AlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIlcAspIle 
1922 GCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATC 
CGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAG 

" ^ AA A AAA 

1927 DDE1, 1930 ALU1, 1934 AHA11 BAN1 HAE11 NAR1 NLA1V, 1935 
HHA1, 1937 FNU4H1, 1966 AVA2 SAU96, 1969 MNLl, 1978 F0K1, 

LeiiAlaGlyTyxGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGly 
1982 CTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGT 
GAACGTCCCAXACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCA 

AA AA A A A A 

1995 HHA1, 1996 THAI, 2010 BAN 11 BSP1286 HGIA SAC1, 2011 ALU 
1, 2021 BSM1, 2029 MBOl SAU3A, 2032 NLA111, 2039 HPH, 

GluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAla 
204 2 GAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCC 
CTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGOGGTAGGAGAGCGGGCCTCGG 

A A A AA A A A AA AA A 

2042 MNL1, 2044 AVA2 NLA1V SAU96, 2049 MNL1, 2057 MML1, 2059 
AVA2 SAU96, 2060 TTHIII1, 2062 EC0R11 SCRF1, 2083 FOKl, 208 
6 MNLl, 2093 NCI1 SCRF1, 2094 HP All, 2096 NLA1V, 2097 BAN11 
BSP1286, 2101 MNL1/ 

LeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGly 
2102 CTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGG 
GAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCC 

A A AA A A AA A 

2123 BBV FNU4H1, 2134 HHA1, 2136 NAE1, 2137 HPA11, 2142 MAE2 
, 2147 HAE111 SAU96, 2149 AVA1 NCI1 SCRF1 SMA1, 2150 HP All M 



f 



FIG. 79E 



CIl SCRFl, 2156 MNL1, 

AlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
2162 GCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 

2172 F0K1, 2179 HPA11, 2196 MNL1, 2199 AVA1 NCI1 SCRFl SMA1, 
2200 HPA11 NCI1 SCRFl, 2205 NLA1V, 2210 NLA111, 
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10 20 30 40 

GAATTCGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 

/SSp CTCTCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATAT 
550 560 570 580 590 .,6.00 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTAGT- 



AACGGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GGTAGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 

• ••••••■«•••■•■•■■•••••■••••••••■••••••••••••**•••**••*,* 

• ••••••••••••••••■•••••••••■*■•••••«•••••*••••••••••*•..• 

AATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTG 
670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTGGC 

•••••*••*•••••••••••••••••■•*«••*•••••*•*••■*•*•••••••• 

•••••••••****•••••••••*>•*••>•••••**•••*••••••■■■*■»««• 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 

730 740 750 760 770 780 

230 240 250 260 270 280 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 

AGTGCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGG 
790 800 810 820 830 840 

290 300 310 320 330 340 

CAAAACTACGGCTAGCCTTACTGGTCTCTTCAATTTAGGTGCCAAGCAGAACATCCAGCT 

* * • • • a a ■ • ■ • • • • • * •■••■•■«• ••• ••••*•• 

* • ■ • ■ • ■ * •■• ■ * 

CCACACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCT 
850 860 870 880 890 900 

350 360 370 380 390 400 

GATCAACACCAACGGCAGTTGGCACATCAACAGGACGGCCTTGAACTGCAATGATAGCCT 
•••••••♦•♦••••••••••••••a «• ••««•• 

••*•••••••*••••••»*••••»• * • •••••■■••••■*««**•• 

GATCAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCT 
910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
::::::::::: :X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 
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1) HCT #18 (USA) 

2) JH23 (USA) 

3) JH 27 (USA) 

4) PBL-Th (USA) 

5) EC1 (Italy) 

6) HCV-1 (chimpanzee) 

C/M s 



3 clones sequenced 
? 
? 

2 clones sequenced 

3 clones sequenced 
multiple 

(p) 



1) 
2) 
3) 
4) 

5) 

6)RNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARALAHGVRVLEDGVNYATGNL 



1) H 

2) 

3) S T T 

4) L 

5) <F> S 

6) PGCSFSI FLLALLSCLTVPASAYQVRNSTGLYH VTN D C PN S S I VYE AAD A I LH 

1) (H) V V T 

2) A D V V K T 

3 ) S PVA N 

4) A ART 

5) H V T 

6) TPGCVPCVREGNASRCWVAMTPTVATRDGKLPATQLRRHIDLLVGSATLCS 



D 

2) I D 

3) D 
4) 

5) I 

6) ALYVGDLCX3SVFLVGQLFTFSPRRHWTTQGCNCSI 
SUMMARY: "S" AA117-308 (93%) 

HCT#18, PBL-Th, EC1 (Italy) have 97% homology with HCV-1 

JH23 and JH 27 have 96% and 95% homology with HCV-1 Respectively 



AA#300-438 ( C-terminal region of the putative envelope region and 
amino -1/3 of NSH 



1) JH23 

2) JH27 

3) Japanese isolate (T. Miyamura) 

4) EC10 (Italy) 



5) HCV-1 (chimpanzee) 



? 
? 
? 

2 clones sequenced 

(one nt difference, which did not 

result in an amino acid change) 

multiple 

S < i > NS I 



T 

A 
A 



1) D 

2) D 

3) VS VM V 

4) 

5)TTCK3CNCSIYPGHITGHRMAWDMMMNWSPTTALVMAQLLRIPQAILDMIAGA 



1 ) M R ARSTA VA 

2) T YT N AR TQALT ,F 

3) L Y I M GH R VQ VT TLT 

4) A I A K TASLTA 

5) H WGVLAG I AYFSMVGNWAKVLVVLLLFAGVDAETH VTGGSAGHTVSG FVSL 

1 )FS R I I TV 

2) FT Dl I R A D 

3) FR S Kl V I R Q F 

4) FNL I I R N 

5) LAPGAKQNVQLINTNGSWHLNSTALNCNDSLNTGWL 



SUMMARY: NS 1 AA 330-660 

"Isolate" %Homology (AA330-438) 



JH23 
JH27 

Japanese 
EC10 (Italy) 



83 
80 
73 
84 



^Homology (AA383-405) 

57 
39 
48 
48 
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5' terminus 

CACTCCACCATGAATCACTGCCCTGTGAGGAACTACT6TCTTCAC6CAGAAAGCGTCTAG 
CCATGGCGTTAGTATGAGTGTCGTGCAGCCTCCAGGACCCCCCCTCCCGGGAGAGCCATA 
GTGGTCTGCGGAACCGGTGAGTACACCGGAATTGCCAGGACGACCGGGTCCTTTCTTGGA 
TCAACCCGCTCAATGCCTGGAGATTTGGGCGTGCCCCCGCAAGACTGCTAGCCGAGTAGT 
GTTGGGTCGCGAAAGGCCTTGTGGTACTGCCTGATAGGGTGCTTGCGAGTGCCCCGGGAG-300 



(Putative initiator methionine codon) 
GTCTCGTAGACCGTGCACCA' 



G C 

TGAGCACGAATCCTAAACCTCAAAAAAAAAACAAACGTAA 
CACCAACCGTCGCCCACAGGACGTCAAGTTCCCGGGTGGCGGTCAGATCGTTGGTGGAGT 
TTACTTGTTGCCGCGCAGGGGCCCTAGATTGGGTGTGCGCGCGACGAGAAAGACTTCCGA 
GCGGTCGCAACCTCGAGGTAGACGTCAGCCTATCCCCAAGGCTCGTCGGCCCGAGGGCAG 
GACCTGGGCTCAGCCCGGGTACCCTTGGCCCCTCTATGGCAATGAGGGCTGCGGGTGGGC-600 
GGGATGGCTCCTGTCTCCCCGTGGCTCTCGGCCTAGCTGGGGCCCCACAGACCCCCGGCG 
TAGGTCGCGCAATTTGGGTAAGGTCATCGATACCCTTACGTGCGGCTTCGCCGACCTCAT 
GGGGTACATACCGCTCGTCGGCGCCCCTCTTGGAGGCGCTGCCAGGGCCCTGGCGCATGG 
CGTCCGGGTTCTGGAAGACGGCGTGAACTATGCAACAGGGAAGCTJCCTGGTTGCTCTTT 

C 

CTCTATCTTCCTTCTGGCCCTGCTCTCTTGCTTGACTGTGCCCGCTTCGGCCTACCAAGT-900 

GCGCAACTCCACGGGGCTTTACCACGTCACCAATGATTGCCCTAACTCGAGTATTGTGTA 

CGAGGCGGCCGATGCCATCCTGCACACTCCGGGGTGCGTCCCTTGCGTTCGTGAGGGCAA 

CGCCTCGAGGTGTTGGGTGGCGATGACCCCTACGGTGGCCACCAGGGATGGCAAACTCCC 

CGCGACGCAGCTTCGACGTCACATCGATCTGCTTGTCGGGAGCGCCACCCTCTGTTCGGC 

CCTCTACGTGGGGGACCTATGCGGGTCTGTCTTTCTTGTCGGCCAACTGTTCACCTTCTC-12O0 

TCCCAGGCGCCACTGGACGACGCAAGGTTGCAATTGCTCTATCTATCCCGGCCATATAAC 

G 

GGGTCACCGCATGGCATGGGATATGATGATGAACTGGTCCCCTACGACGGCGTTGGTAAT 

GGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTGGGG 

AGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGTAGT 

GCTGCTGCTATTTGCCGGCGTCGACGCGGAAACCCACGTCACCGGGGGAAGTGCCGGCCA-1500 

CACTGTGTCTGGATTTGTTAGCCTCCTCGCACCAGGCGCCAAGCAGAACGTCCAGCTGAT 

CAACACCAACGGCAGTTGGCACCTCAATAGCACGGCCCTGAACTGCAATGATAGCCTCAA 

CACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCCTGA 

GAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTGGGGCCCTATCAGTTA 

TGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACCCCCCAAAACCTTG-1800 

CGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCACTCCCAGCCCCGT 

GGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGGGTGAAAATGATAC 

GGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGTTCGGTTGTACCTG 

GATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTGTCATCGGAGGGGC 

GGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATCCGGACGCCACATA-2100 

C 

CTCTCGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCGACTACCCGTATAG 

GCTTTGGCATTATCCTTGTACCATCAACTACACCATATTTAAAATCAGGATGTACGTGGG 

AGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCGAACGTTGCGATCT 

GGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTACACAGTGGCAGGT 

CCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCATCCACCTCCACCA-24O0 

GAACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCGCGTCCTGGGCCAT 

TAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTG 

CTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACT 

TAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTT 

TGCATGGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTG-2700 

GCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGT 

GGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATA 

TTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGA 

AGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCAT 



FIG. 89B 

CTTACTCAT6TGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCT6CTGGC-3000 

CGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCG 

CGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGT 

GCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCAC 

TCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGT 

CGTCTTCTCCCAAATGGAGACCAAGCTCATCACGTGGGGGGCAGATACCGCCGCGTGCGG-3300 

TGACATCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCC 

AGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCA 

GCAGACAAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGGACAAAAACCA 

AGTGGAGG6TGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCAT 

CAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAA-3600 

T 

GGGTCCTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCC 
C 

GCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCAC 

GAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTC 

GCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGG " 

GCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGA-3900 

CTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTC 

CTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAG 

CGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACT 

CAACCCCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCAAGGCTCATGGGAT 

T 

CGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTC-4200 

CACCTACGGCAAGTTCCTTGCCGACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAAT 

TTGTGACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGA 

CCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCCTCCGGGCTC 

CGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCACCGGAGAGATCCC 

TTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGGAGACATCTCATCTTCTG-A500 

TCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGC 

CGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGT 

A 

CGTGGCAACCGATGCCCTCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTG 

CAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCTTCACCATTGAGAC 

AATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGG-4800 

GAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGCATGTTCGACTC 

GTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGA 

GACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGGACCA 

TCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCCCACTTTCTATC 

CCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTG-5100 

CGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGTTTGATTCGCCT 

CAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAGAATGA 

AATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGGA 

GGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTG 

CCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCAAT-5400 

CATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCA 

CTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCGG 

CCTCCTGCAGACCGCGTCGCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTG 

GCAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATA 

CTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTAC-5700 

AGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATATTGGGGGG 

GTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTT 

AGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGG 

GTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCC 

CTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGT-6000 

CGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA 

GTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTA 

CGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAAC 

CCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGG 
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TTCCT66CTAA6GGACATCT6GGACTGGATATGC6AGGT6TTGAGCGACTTTAAGACCTG-6300 

GCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCCTTTGT 

GTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGA 

GATCACTGGACAT6TCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTG^ 

CATGT6GAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCC _ 

TGCGCCGAACTACACGTTCGGGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAG-6600 

GCAGGTGGGGGACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTG 

CCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGC 

GCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACGAATA 

CCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCAT ^ nnn 

GCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATC-6900 

ACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAAC 

TTGCACCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAG 

GCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAGT^ 

CTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAAT nnn 

CCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAA-72Q0 

CCCCCCGCTAGTGGAGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTG 

TCCGCTTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGGAAGAAGCGGACGGTGGT 

CCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCAGAAGCTTTGGCAG 

CTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTGAGCCCGCCCCTTC 

TGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGGAGGGGGA-7500 

GCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTAGTGAGGCCAACGC 

GGAGGATGTCGTGTGCTGCTCAATGTCTTACTCTTGGACAGGCGCACTCGTCACCCCGTG 

CGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACCA 

CAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATT 

TGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGC-7800 

GGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGGAAGCTTGCAGCCTGACGCCCCC 

ACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTTGCCATGCCAGAAA 

GGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACAATGTAACACCAAT 

AGACACTACCATCATGGCTAAGAACGAGGTTTTCTGCGTTCAGCCTGAGAAGGGGGGTCG n , _ 

TAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGTGCGAAAAGATGGC-8100 

TTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCTCCTACGGATTCCA 

ATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCC 

AATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCG 

TACGGAGGAGGCAATCTACCAATGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAA nn 

GTCCCTCACCGAGAGGCTTTATGTTGGGGGCCCTCTTACCAATTCAAGGGGGGAGAACTG-8A00 

CGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTGGTAACACCCTCAC 

TTGCTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGGACTGCACCATGCT 

CGTGTGTGGCGACGACTTAGTCGTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGC 

GAGCCTGAGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCCCTGGGGACCCCCC n ^ n 

ACAACCAGAATACGACTTGGAGCTCATAACATCATGCTCCTCCAACGTGTCAGTCGCCCA-8700 

CGACGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAG 

AGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCAT 

GTTTGCCCCCACACTGTGGGCGAGGATGATACTGATGACCCATTTCTTTAGCGTCCTTAT 

AGCCAGGGACCAGCTTGAACAGGCCCTCGATTGCGAGATCTACGGGGCaGCTAaCCAT nnnn 

AGAACCACTGGATCTACCTCCAATCATTCAAAGACTCCATGGCgCAGCGCATUTCACT-9000 

CCACAGTTACTCTCCAGGTGAAATTAATAGGGTGGCCGCATGCCTCAGAAAACTTGGGGT 

G 

ACCGCCCTTGCGAGCTTGGAGACACCGGGCCCGGAGCGTCCGCGCTAGGCTTCTGGCCAG 

AGGAGGCAGGGCTGCCATATGTGGCAAGTACCTCTTCAACTGGGCAGTAAGAACAAAGCT 

CAAACTCACTCCAATAGCGGCCGCTGGCCAGCTGGACTTGTCCGGaGGnCACGGa 

CTACAGCGGGGGAGACATTTATCACAGCGTGTCTCATGCCCGGCCCCGCTGGATCTGGTT-9300 

TTGCCTACTCCTGCTTGCTGCAGGGGTAGGCATCTACCTCCTCCCCAACCGATGAAGGTT 

GGGGTAAACACTCCGGCCT 3 ' te rm i nus 

Some clonal heterogeneities producing amino acid 
substitutions are shown. There are many other 
"silent mutations (not shown). 



FIG. 90A 

R T 

MSTNPKPQKKNKRNTNRRPQDVKFPGGGQIVGGVYLLPRRGPRLGVRATR 

KTSERSQPRGRRQPIPKARRPEGRTWAQPGYPWPLYGNEGCGWAGWLLSP-100 

RGSRPSWGPTDPRRRSRNLGKVIDTLTCGFADLMGYIPLVGAPLGGAARA 

LAHGVRVLEDGVNYATGNLPGCSFSIFLLALLSCLTVPASAYQVRNSTGL-200 
YHVTNDCPNSSIVYEAADAILHTPGCVPCVREGNASRCWVAMTPTVATRD _ 
GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTFSPRRHWT-300 

V 

TQGCNCS I YPGH I TGHRMAWDIWINWSPTTALVMAQLLRI PQA I LDMIAG 
AHWGVLAGIAYFSMVGNWAKVLWLLLFAGVDAETHVTGGSAGHTVSGFV-AOO 
SLLAPGAKQNVQL I NTNGSWHLNSTALNCNDSLNTGWLAGLFYHHKFNSS nnn <. 
GCPERLASCRPLTDFDQGWGP I SYANGSGPDQRPYCWHYPPKPCG I VPAK-500 
SVCGPVYCFTPSPVVVGTTDRSGAPTYSWGENDTDVFVLNNTRPPLGNWF 
GCTWMNSTGFTKVCGAPPCVIGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

PWLTPRCLVDYPYRLWHYPCTINYTIFKIRMYVGGVEHRLEAACNWTRGE 
RCDLEDRDRSELSPLLLTTTQWQVLPCSFTTLPALSTGL I HLHQN VDVQ-700 
YLYGVGSSIASWAIKWEYVVLLFLLLADARVCSCLWMMLLISQAEAALEN 
LVILNAASLAGTHGLVSFLVFFCFAWYLKGKWVPGAVYTFYGMWPLLLLL-800 

(N) 

LALPQRAYALDTEVAASCGGVVLVGLMALTLSPYYKRYI SWCLWWLQYFL nnn 
TRVEAQLHVWIPPLNVRGGRDAVI LLMCAVHPTLVFDITKLLLAVFGPLW-900 
ILQASLLKVPYFVRyQGLLRFCALARKMIGGHYVQMynKLGALTGTYVY 
NHLTPLRDWAHNGLRDLAVAVEPVVFSQMETKLITWGADTAACGDI INGL-IOOO 
PVSARRGREI LLGPADGMVSKGWRLLAPITAYAQQTRGLLGCI ITSLTGR 
DKNQVEGEVQIVSTAAQTFLATCINGVCWTVYHGAGTRTI ASPKGPVIQM-llOO 

YTNVDQDLVGWPAPQGSRSLTPCTCGSSDLYLVTRHA^ 

SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 

LETTMRSPVFTDNSSPPVVPQSFQVAHLHAPTGSGKSTKVPAAYAAQGYK 

VLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTITTGSPITYSTYGKFL-1300 

ADGGCSGGAYD 1 1 1 CDECHSTD ATS I LG I GTVLDQAETAGARL VVLATAT 

PPGSVTVPHPN I EEVALSTTGEI PFYGKA I PLEVI KGGRHL I FCHSKKKC-1400 
DELAAKLVALGINAVAYYRGLDVSVIPTSGDVVVVATDALMTGYTGDFDS 

Y (S) 
VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSVLCECYDAGCAWYELTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCL I RLKPTLHGPTPLLYRLGAVQNE ITLTHPVTKY I MTCMS ^ n 
ADLEVVTSTWVLVGGVLAALAAYCLSTGCVVI VGRVVLSGKPA 1 1 PDREV-1700 
LYREFDEMEECSQHLPYIEQGMMLAEQFKQKALGLLQTASRQAEVI APAV onn 
QTNWQKLETFWAKHMWNF I SG I QYLAGLSTLPGNPA I ASLMAFTAAVTSP-1800 
LTTSQTLLFN I LGGWVAAQLA APGAATAFVGAGLAGAA I GSVGLGKVL I D 
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(G) 

ILAGYGAGVAGALVAFKIMSGEVPSTEDLVNLLPAILSPGALVVGVVCAA-1900 

(HO 

ILRRHVGPGEGAVQWMNRLIAFASRGNHVSPTHYVPESDAAARVTAILSS 
LTVTQLLRRLHQW I SSECTTPCSGSWLRD I WDW I CEVLSDFKTWLKAKLM-2000 

(V) 

PQLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 

TCRNMWSGTFPINAYTTGPCTPLPAPNYTFALWRVSAEEYVEIRQVGDFH-2100 

YVTGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 

LHEYPVGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS-2200 

SSASQLSAPSLKATCTANHDSPDAELIEANLLWRQENGGNITRVESENKV 

VI LDSFDPLVAEEDEREI SVPAEI LRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

WKKPDYEPPVVHGCPLPPPKSPPVPPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSSSTSGITGDNTTTSSEPAPSGCPPDSDAESYSSMPPLEGEPGDPDL-2400 

SDGSWSTVSSEANAEDVVCCSMSYSWTGALVTPCAAEEQKLPINALSNSL 

LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA-2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRCHARKAVTHINSVWKDLLEDN 

VTPIDTTIMAKNEVFCVQPEKGGRKPARLIVFPDLGVRVCEKMALYDVVT-2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G) 

SDIRTEEAIYQCCDLDPQARVAIKSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLVVICESAGVQ 
EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTFVNSWLGNI IMFAPTLWARMILMTHFF 
SVLIARDQLEQALDCEI YGACYSIEPLDLPPI IQRLHGLSAFSLHSYSPG-2900 

G 

EINRVAACLRKLGVPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 

(P) 

RTKLKLTPIAAAGQLDLSGWFTAGYSGGDI YHSVSHARPRWIWFCLLLLA-3000 
AGVGIYLLPNRO-3011 

Stop codon 



( ) = Heterogeneity due possibly 

to 5' or 3' terminal cloning 
artefact. 
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